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NON-SPECIFIC DNA AMPLIFICATION 

The present invention is a continuation-in-part of U.S. 
5 patent application for "DNA Amplification and Subtraction 
Techniques," Serial No. 208,512, filed June 17, 1988. 
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Field of the Invention 

The present invention relates to a method of non-specific 
DNA amplification, and to applications of the method for cDNA 
selection. 
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Background of the Invention 

The ability to isolate and clone cDNA fragments which are 
20 homologous to contiguous genomic regions has important appli- 
cations in medicine. As one example, isolating cDNAs which 
are all homologous to a DNA section containing a known but as 
yet unlocalized disease-causing gene would result in the 
isolation of that gene and flanking genes; this analysis would 
facilitate the process of localization and analysis of the 
disease-causing gene. A second important application is the 
possibility of identifying new genes which are linked to known 
markers ♦ 

As another example, homologies between two related virus- 
es may be utilized by binding to the genome of a first virus, 
cDNAs which are derived from a cell type infected with a 
second virus, then isolating those cDNA sequences which are 
homologous to the first viral genome. 
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Another hybridization method which has important applica- 
tions to medical research involves the selection of genomic or 
cDNA fragments which are either common or unique to two dif- 
ferent biological sources, e.g./ cell types, or cell types at 
5 different stages of activation or development. 

Standard methods for obtaining ■ cDNAs on the basis of 
their hybridization to genomic DNA material are often limited 
in the amount of genomic DNA or cDNA material which is avail- 
able , particularly when it is desired to clone a large number 
10 of different-sequence fragments obtained from the hybridiza- 
tion step. This limitation can result in loss of a large 
number of fragments whose abundance in the hybridization 
elution mixture is very low. 

15 Summary of the Invention 

It is therefore one object of the invention to provide a 
method of cloning cDNA species which are homologous to a 
region of contiguous genomic DNA and are selected from a 
mixture of cDNA species. The method involves obtaining the 

20 region of interest of contiguous genomic DNA and preparing 
cDNA species which contain end-terminal priming sequences. 
Single-stranded cDNA species are isolated on the basis of 
their hybridization to the region of contiguous genomic DNA. 
These isolated single-stranded cDNA species are mixed with DNA 

25 polymerase, all four deoxyribonucleotide triphosphates, and 
primers homologous to the cDNA end-terminal priming sequences. 
This mixture is then reacted under conditions to produce 
sequence- independent amplification of the single-stranded cDNA 
species. The amplified cDNA species resulting from this 

30 reaction are cloned into a vector. 

The method of the present invention can also include a 
second cDNA molecule isolation step. The single-stranded cDNA 
molecules are isolated on the basis of their homology to a 
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first set of selected genomic fragments and are then ampli- 
fied. The amplified cDNA species are denatured and a new set 
of single-stranded cDNA species are isolated on the basis of 
their hybridization to a second set of. selected genomic frag- 
5 ments. For example, the cDNA species may be rehybridized 
against the first set of selected genomic fragments for en- 
richment, or, hybridized against a different set of genomic 
fragments to select a subset of the cDNA species. 

In the preparation of cDNA species to contain end-ter- 
10 minal priming sequences, the above method may further include 
the amplification of the cDNA species- by sequence -independent 
amplification. 

The region of contiguous DNA in the above method may 
contain a known gene or region of interest. DNA fragments 

15 containing the known gene or region of interest may be ob- 
tained as follows. A DNA section containing the region of 
interest is size fractionated, the fraction containing the 
known coding sequence is identified, and the DNA from the 
identified fraction is isolated. One method to identify the 

20 DNA fragments of interest is by primer-initiated amplification 
in the presence of primers which are homologous to sequences 
in the region of interest. This isolated DNA can then be 
digested with a restriction endonuclease to cut the region of 
interest into a plurality of smaller fragments. 

25 in one embodiment of the above method, the contiguous 

genomic DNA section is bound to a solid support. 

It will be appreciated that the contiguous genomic DNA 
section can be contained in a number of cloning vectors, known 
to one of ordinary skill in the art, for cloning large seg- 

30 ments of DNA, for example, yeast artificial chromosomes. 

One embodiment of the above method utilizes a contiguous 
genomic DNA section from the bovine leukosis virus genome and 
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a cDNA library made from mRNA obtained from a cell line in- 
fected with a virus derived from cell line 10C9. 

Another application of the above method includes the 
means for identifying cDNAs which correspond to genes located 
5 in the same chromosomal region as a known gene or region of 
interest . The fragments containing • the genomic DNA region 
containing a known gene or region can be obtained by prepara- 
tive size fractionation of genomic DNA fragments containing 
the known gene. One method to identify the DNA fragments of 

10 interest is by adding primers homologous to the known gene, 
DNA polymerase, and all four deoxyribonucleotides, to the gel 
matrix and treating the gel matrix under conditions which 
promote amplification of the region of the known gene defined 
by the primers. Linkers, which are useful as primers for 

15 sequence-independent amplification, are then ligated to the 
ends. Redundant linkers on the ends of the molecules can be 
removed by digesting with the appropriate restriction enzyme. 
The DNA fragments are mixed with DNA polymerase, all four 
deoxyribonucleotides, and primers homologous to the linkers 

20 present on the ends of the DNA fragments, and reacted under 
conditions to produce sequence-independent amplification of 
the DNA fragments. 

The above described method can be used for genomic DNA 
regions containing such known genes as growth factor genes and 

25 growth factor receptor genes including the following: inter- 
leukins (e.g., IL-5 and IL-3) , GM-CSF, and erythropoietin. 
With each of these genes an appropriate. cDNA library is cho- 
sen, such as, a T-cell cDNA library when using IL-5, or a 
renal cDNA library when using erythropoietin. 

30 These and other objects and features of the invention 

will be more fully understood when the following detailed 
description of the invention is read in conjunction with the 
accompanying drawings. 
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Brief D scription of the Drawings 

Figure 1 is a flow diagram of the duplex amplification 
method of the invention; 

Figures 2A-2C show the sequences of three exemplary 
5 linkers used in practicing the invention, after attachment to 
opposite ends of blunt-ended duplex fragments (solid lines); 

Figure 3 illustrates one method for biotinylating ampli- 
fied cDNA fragments, in practicing the invention; 

Figure 4 is a flow diagram of the method of the invention 
10 for isolating DNA fragments which are unique to one fragment 
source; 

Figure 5 illustrates the isolation method shown in Figure 
4 applied to isolation of an RNA-derived sequence which is 
unique to individuals infected with a given viral agent; 
15 Figure 6 illustrates the amplification method of Figure 

1, as it is used for confirming the presence of ET-NANB viral 
agent in bile from an infected animal, and for assaying the 
presence of the viral agent in infected individuals; 

Figure 7 shows regional assignments of potential growth 
20 factor/receptor genes within a subregion of chromosome 5. The 
following designations are used: 

CSF-GM, granulocyte-macrophage colony stimulating 
factor; 

CD14, encodes an antigen which is a surface marker 

25 of monocytes; 

IL-3, multi-colony stimulating factor; 

IL4, B-cell growth factor; 

IL5, T-cell replacing factor; 

PDGFR, platelet-derived growth factor receptor; 
30 ADRBR, beta-2 adrenergic receptor; 

FGFA, fibroblast growth factor; 

CSF1R, CSF1 receptor (the protooncogene c-fms) 

CSF1, macrophage colony stimulating factor; 
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Figure 8 shows the steps in the preparation of small 
amplified genomic fragments derived from a region of human 
chromosome 5 containing the human IL-5 gene; 

Figures 9A and 9B show isolated genomic DNA from the 
5 HHW105 hamster/human hybrid cell line digested with Not I 
fractionated on a contour-clamped homogeneous electric field 
(CHEF) gel (9B), and sample products amplified with IL-5 
specific primers (9B); 

Figure 10 shows Southern blots of fragments amplified by 
10 IL-5-specific primers, which were fractionated on an agarose 
gel; 

Figure 11 is an agarose gel fractionation of the non- 
specific amplification products of genomic fragments derived 
from the chromosome 5 region surrounding the IL-5 coding 
15 region; 

Figure 12 is an agarose gel fractionation of T-cell 
cDNA's which hybridized with the genomic DNA fragments from 
Figure 11; 

Figure 13 shows the hybridization of T-cell cDNA's 
20 isolated on* the basis of homology to the IL-5 region of human 
chromosome 5 to a variety of probes, including probes specific 
for actin, IL-3 and IL-5 coding regions; 

Figure 14 shows the hybridization of T-cell cDNAs iso- 
lated on the basis of homology to the IL-3 region of human 
25 chromosome 5 to a variety of probes including probes specific 
for IL-3, GM-CSF and actin coding regions, and to probes for 
total human genomic DNA; 

Figure 15 shows long-range restriction maps, adapted 
from VanLeeuwen et al (1989), of the human chromosome 5 re- 
30 gions containing IL-5 and IL-3; and 

Figure 16 shows how the method of the invention may be 
applied to identifying specific disease-related loci and their 
gene products. 
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Detailed Description of the Invention 

I. Amplifying DNA Fragments 
A. Sources of DNA Fragments 
5 Figure 1 illustrates the method of amplifying duplex DNA 

fragments according to the invention-. The duplex fragments 
are typically present in a fragment mixture, and typically a 
mixture of cDNA fragments produced from messenger RNA (mRNA) 
transcript species, although genomic DNA fragments or linea- 

10 rized vectors or vector fragments are also suitable. 

Methods for isolating mRNA species from cellular or 
body-fluid samples, such as serum or bile, are well known. 
One method involves formation of a vanadyl-RNA complex, ex- 
traction of protein with chloroform/phenol, and precipitation 

15 with cold ethanol. In a second method, the RNA is extracted 
from a guanidinium isothiocyanate mixture with phenol, fol- 
lowed by a chloroform risoamyl alcohol extraction, and precipi- 
tation of RNA from the aqueous phase with cold ethanol. The 
reader is referred to Maniatis, pp. 188-198, and references 

20 cited therein for details. 

As a rule, eukaryotic mRNAs are characterized by a 3' 
poly A terminal sequence which allows isolation by affinity 
chromatography, using oligo-dT or poly U bound to a solid 
support. In addition, or alternatively, total isolated RNA 

25 can be further fractionated by density gradient centrifuga- 
tion, or agarose gel electrophoresis, to obtain a desired size 
fraction of RNA species. 

Production of duplex cDNAs from the isolated mRNA tran- 
scripts is by conventional oligo dT or random priming of first 

30 strand synthesis. The former method is advantageous where 
duplex cDNAs derived only from poly A RNAs are desired. In 
this method, the RNA transcripts are primed with oligo dT, to 
promote first-strand cDNA synthesis by reverse transcriptase 
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in the presence of all four deoxynucleotides . The 3' hairpin 
formed from the first-strand synthesis product is used to 
prime the synthesis of the second strand by E. coli DNA poly- 
merase I, according to well-known methods (Maniatis, pp. 
5 213216) . 

The random priming method of cDNA duplex formation is 
preferred where (a) duplex formation is not to be limited to 
poly A species, or (b) full-length duplex cDNAs are not re- 
quired. The method for first-strand cDNA synthesis utilizes 

10 an arbitrary sequence primer which is commercially available. 

Following second-strand synthesis, the cDNA fragments 
are preferably blunt-ended, for example, by filling in sticky 
ends with the large fragment of E. coli DNA polymerase I 
(Klenow fragment) as described below in the Materials section. 

15 Intact chromosomal DNAs can be isolated from agarose- 

immobilized cell preparations in those cases where chromosomal 
length DNAs are required. Genomic DNA from a selected cell 
source can also be isolated by standard procedures, which 
typically include successive phenol and phenol /chloroform 

20 extractions with ethanol precipitation. Genomic DNA fragments, 
either in a fragment mixture or a purified preparation, are 
also suitable for amplification, according to the method of 
the invention. The duplex DNA is fragmented preferably by 
partial or complete digestion with one or more selected re- 

25 strict ion endonucleases, although mechanical shearing may be 
employed. The fragmented genomic pieces may be size frac- 
tionated, or further treated to remove repetitive DNA. Other 
sources of double-stranded DNA fragments can include extra- 
chromosomal material/ e.g., mitochondrial DNA, double-stranded 

30 DNA viruses, or viruses which have as part of their life cycle 
a double-stranded intermediate, e.g., a retrovirus. 

Cellular sources of genomic DNA fragments or RNA tran- 
scripts used for producing cDNA fragments include cultured 
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cell lines, or isolated cells or cell types obtained from 
tissue (or whole organs or entire organisms) . Cell sources 
are of interest in a variety of subtraction techniques where 
it is desired to identify or isolate particular RNA tran- 
5 scripts or genomic material which are unique to one of two 
related cell sources. Body-fluid sources of DNA and/or mRNA 
transcripts are of interest primarily where the fluid is known 
or suspected to contain a viral agent or other microbe of 
interest. Example 4, for instance, describes cDNA fragment 

10 mixtures produced from RNA isolated from bile, taken before 
and after infection of cynomolgus monkeys with' enterically 
transmitted non-A/non-B (ET-NANB) hepatitis virus . 

Linearized or fragmented plasmid DNA, or fragmented 
phage DNA is another source of DNA fragments which one might 

15 wish to amplify. The vector DNA is obtained from purified 
plasmid or phage DNA according to conventional techniques, and 
linearized and/or fragmented by digestion with selected re- 
striction endonuclease(s) . Example 2 describes DNA fragments 
obtained by Haelll digestion of phiX174 phage and linearized 

20 piAN13 plasmid. 

B. Fragment Linkers 

According to an important feature of the invention, the 
duplex DNA fragments to be amplified are ligated at their 

25 opposite end to a linker, to provide a priming sequence for 
strand duplication, i.e. an end-terminal primer. The linker 
is preferably a short duplex DNA fragment, typically about 
20-30 basepairs in length, having a defined base pair 
sequence. One end of the linker is designed for enzymatic 

30 ligation to opposite fragment ends. Where the fragments are 
blunt-ended cDNAs or genomic fragments, this linker end is 
also blunt, for ligation to the fragment ends in the presence 
of T4 DNA ligase, as described in Example 2. Where the DNA 
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fragment mixture is formed by endonuclease digestion of 
genomic DNA, and produces fragments with sticky ends, these 
ends can either be filled in to generate blunt ends or the 
linkers can be prepared with complementary sticky ends at the 
5 fragment-ligation end. 

Since the ligation reaction involving the linkers also 
produces linker-to-linker ligation, the linkers are preferably 
designed so that dimers can be selectively cleaved by restric- 
tion endonuclease digestion. This can be done by constructing 

10 the blunt end of the linker with a sequence representing 
one-half of a selected restriction-site sequence. The re- 
striction site formed by blunt-end dimerization is preferably 
a rare cutter site, so that digestion of the fragments with 
the associated endonuclease, after linker addition to the DNA 

15 fragments, does not cleave a significant number of fragments 
internally, and thereby produce fragments with a linker at one 
end only. 

Figure 2 shows DNA fragments (solid lines) with exem- 
plary linkers A, B, or C attached to opposite fragment ends. 

20 As seen, each of the three linkers has one-half of an Nru l 
site at its blunt end, whereby linker dimers formed by blunt 
end ligation can be cleaved by Nru l digestion. It should be 
noted that linker ligation to the DNA fragments is direction- 
al, and linker self-ligation is limited to dimer formation, 

25 due to the one staggered end of the linker. The linkers are 
joined to blunt-ended fragments as indicated above, and as 
illustrated at the top in Figure 1 . It will be appreciated 
that analogous linkers with one sticky end for ligation to 
corresponding fragment sticky ends could be similarly em- 

30 ployed. However, these linkers have the disadvantage that 
recleavage for removal of linker dimers is not possible. 

Also as seen in Figure 2, the linkers preferably include 
one or more internal restriction sites at which the fragments 
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can be cleaved after amplification. The internal linker 
restriction site(s) can serve three purposes. First, where 
the amplified fragments are used in the DNA subtraction method 
described in Section D-l below, the restriction site can be 
5 used to cleave away a major portion of the linker in one of 
the two fragment mixtures, to prevent hybridization between 
linker regions in strands from the two different mixtures, 
where the same linker is used in the amplification of both 
fragment mixtures. Secondly, the linker cutting site(s) 

10 allows the amplified fragments (or the hybridized fragments in 
the Section Dl subtraction method) , to be equipped with de- 
sired sticky end sites for cloning into a cloning vector. 
Thirdly, as will be seen in Section D-l, the linker restric- 
tion site may be used to create a single-strand end, for 

15 purposes of biotinylating the fragments. In particular, an 
internal EcoRI site produces an AATT overhang which can be 
filled in with commercially available biotinylated dUTP and 
dATP nucleotides. 

Synthetic duplex oligonucleotide linkers having selected 

20 sequences, such as those shown for the three exemplary linkers 
in Figure 2, can be prepared using commercially available 
automated oligonucleotide synthesizers. Alternatively, custom 
designed synthetic oligonucleotides may be purchased, for 
example, from Synthetic Genetics (San Diego, CA) . 

25 

C. Fragment Amplification 

. The method steps described above are shown at the top in 
Figure 1. These steps include obtaining duplex fragments, 
blunt-ending and attaching a linker to opposite fragments ends 
30 to produce linker-carrying fragments, such as illustrated in 
Figure 2, and treating with a selected endonuclease to cut 
linker multimers at their blunt-end junctions. Alternatively, 
the linker dimers can be removed by fractionating the DNA by 
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gel filtration or agarose gel electrophoresis/ and eluting all 
DNAs larger than linker dimers. 

The linker-fragments from above are amplified by the 
repeated fragment duplication according to the following 
5 steps. First, the fragments are mixed with a large molar 
excess of a single-strand oligonucleotide primer, typically a 
10 6 -10 9 molar excess- The primer sequence is homologous to the 
fragment end linker; that is, the primer sequence and length 
is such as to promote hybridization to the complementary-lin- 

10 ker regions of the two strands under moderately stringent 
reannealing conditions. Another requirement is that the 
primer, when hybridized to the linker region of the denatured 
fragment strands, be capable of priming polymerase-catalyzed 
strand replication; that is, the internal end of the primer 

15 provides a free 3' -OH. In the case of the linkers shown in 
Figure 2, preferred primer sequences are: 

d (5 ' -GGAATTCGCGGCCGCTCG-3 ' ) for linker A; 
d (5' -GGTTGCGGCCGCTCG-3' ) for linker B; and 
20 d(5' -GACGCGTGAATTCTCG-3") for linker C. 

Single-strand oligonucleotide primers having the desired 
sequence are prepared by conventional methods or can be ob- 
tained from commercial laboratories, as above. 

25 The denatured fragments and primer are cooled, typically 

between about 37-60°C, to allow primer hybridization with the 
fragment linker ends. The cooling period is kept quite short, 
typically less than about 5 minutes, to minimize strand-strand 
hybridization which would be expected on long-term reanneal- 

30 ing. At this point, or at the primer-addition stage, the four 
deoxynucleotides and a DNA polymerase capable of catalyzing 
second-strand, primed replication are added, and the reaction 
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mixture is brought to a temperature suitable for enzymatic 
strand replication. 

In the method described in Example 2 below, the deoxynu- 
cleotldes and DNA polymerase were added prior to fragment 
5 denaturation. After heat denaturing, the mixture was annealed 
at 50°C for two minutes, then brought to 72°C for 5-12 minutes 
for primed, second-strand replication. The DNA polymerase 
used is Thermus aquaticus DNA polymerase (Taq DNA polymerase), 
which is relatively heat-stable at up to 95°C for brief peri- 

10 ods (Kaledin et al; Perkin Elmer Cetus) . 

It will be appreciated from the above, and from Figure 
1, that the single added primer hybridizes with the 3 '-end 
linker region in each DNA strand, and thus a doubling of 
fragment number occurs. The above replication procedure, 

15 which involves fragment denaturation by heating, cooling to 
form a fragment strand-primer complex, and second strand 
replication of the complex in the presence of DNA polymerase, 
is repeated until a desired concentration of fragments is 
achieved. In the above example, which employs a heat-stable 

20 Taq polymerase, the three replication steps are carried out 
simply by heating the fragment mixture to a denaturing tempe- 
rature (above the Tm of the fragments) , cooling briefly to 
allow fragment/ primer complex formation, and incubating for a 
period sufficient for second-strand synthesis. 

25 Example 2 illustrates the application of the fragment 

amplification method to a linearized DNA plasmid (piAN13) and 
to a fragment mixture produced by Haelll digestion of phage 
phiX174 fragments. 

30 D. Utility 

The amplification method is useful for DNA duplex frag- 
ment amplification where limited amounts of genomic or cDNA 
fragment material is available and/or as a simple method of 
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amplifying duplex DNA material where DNA sequences are un- 
known. For example , in the probe binding study reported in 
Example 4, the method was used to increase by several orders 
of magnitude, the number of cDNA fragments derived from bile 
5 infected with enterically-transmitted non-A/non-B hepatitis 
(ET-NANB) viral agent. The amplification allowed the rela- 
tively small amount of viral agent present in the bile to be 
easily identified as an amplified cDNA fraction. 

The method is useful for amplifying the amount of a 
10 fragment or fragments available for cloning, to enhance clon- 
ing efficiency. One advantage of the method for cloning is 
the ability to provide the amplified fragments with end clon- 
ing sites suitable for the cloning vector which has been 
selected. 

15 The method can be applied to purified, enriched, or 

subfractionated DNA fragments, for increasing the amount of 
fragment material available for further processing, e.g., 
further enrichment. For example, in the fragment-isolation 
method described in below, the isolated unique sequence frag- 

20 ments can be further enriched by additional amplification and 
selection* As another example, a DNA subtraction obtained by 
gel electrophoresis can be selectively amplified. Here the 
fragment mixture, prior to fractionation, is equipped with an 
end linker, and the selected fragment band is further iso- 

25 lated, either in situ within the gel, or after selective 
elution, by addition of primer, polymerase, and nucleotides, 
with, repeated heating and cooling replication steps. The 
following specific applications are illustrative. 

30 1 . Isolating Unique Sequences 

(a) Hybridizing Fragment Mixtures 

Figure 4 is a flow diagram of the method for isolating 
unique sequences from one of two fragment mixtures, according 
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to the invention. The two fragment mixtures are referred to 
herein as positive- and negative -source mixtures, and are 
derived from positive and negative sources of nucleic acid 
species , respectively. The positive source may be any cell, 
5 cell group, tissue, organ, subcellular fractions, or other 
defined source of RNA transcripts or genomic fragments, in- 
cluding vector fragments, which contain a number of different- 
sequence species. The negative source is similar to the posi- 
tive source, but lacks one or more positive source species. 

10 By way of example, the positive and negative sources may 

be cell or tissue samples which are , infected (positive source) 
and non-infected (negative source) with a microbial agent. 
Here the nucleic acid species associated with the infecting 
agent, either in the form of genomic material or RNA tran- 

15 script species, are unique to the infected (positive) source. 
Similarly, the positive and negative sources may be a body 
fluid, such as serum or bile, from infected and non-infected 
subjects, respectively. Where the present method is applied 
to the study of hereditary diseases, the positive source is 

20 typically a normal cell line or tissue capable of producing 
the full range of normal proteins associated with that cell, 
and the negative source is a genetically altered cell which is 
suspected of an altered or lost mRNA species. That is, the 
positive source will yield a normal mRNA species which is not 

25 present in the genetically altered, negative source. 

As another example, it is known that malignant transfor- 
mation in cells may be triggered by the activation of onco- 
genes, as evidenced by expression of new RNA transcripts 
associated with the malignant state. In applying the method 

30 of the invention to isolating and identifying such RNA spe- 
cies, the malignant cells or tissue serves as the positive 
source of unique (tumor-related) RNA species, and normal, 
non-transformed cells or tissue, as the negative source. 



WO 91/05861 



PCT/US90/05938 



18 

Other examples of negative and positive sources of transcripts 
or DNA duplex fragments are contemplated. 

The positive and negative sources shown in Figure 4 
yield mixtures of mRNA species, and these are converted to 
5 corresponding duplex DNA fragments as discussed in Section A 
above. The duplex fragments in the two mixtures are blunt- 
ended and separately ligated at opposite fragment ends to a 
linker , as described in Section B. The linker attached to the 
positive- source fragment is preferably different than that 

10 attached to the negative -source fragments. For example, in 
the method outlined in Example 2, the positive source frag- 
ments are ligated to linker A in Figure 2, and the negative- 
source material, with linker C. Attaching linkers with dif- 
ferent sequences to the two fragment mixtures avoids the 

15 problem of linker hybridization when the two mixtures are 
hybridized. Alternatively, the same linker may be attached to 
both fragment mixtures, if the linker contains a restriction 
site which allows substantially all of the linker to be re- 
moved by restriction endonuclease digestion from the nega- 

20 tive-source fragment mixture before hybridization. 

Each fragment mixture is then amplified by repeated 
replications, as described in Section 1C above. In a 
preferred embodiment, the amplified, negative-source fragment 
strands are biotinylated or otherwise equipped with a ligand 

25 which can be used to selectively bind and remove strands or 
duplex species containing the ligand, by affinity chromatog- 
raphy. A variety of methods for biotinylating duplex frag- 
ments are available. For example, biotin-labeled deoxynucleo- 
tides, such as Bio-ll-UTP and Bio-7-dATP, can be included in 

30 the amplification reaction mixture during the final rounds of 
replication. Duplex fragments are readily photobiotinylated 
by a standard procedure, as outlined in the protocol supplied 
by the manufacturer, Clontech (Palo Alto, CA) . In one pre- 
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ferred method, the amplified fragments are treated with an 
endonuclease, such as EcoRI, which produces a 5'-AATT-3' 
sticky end. The sticky end is filled in with Klenow fragment 
in the presence of Bio-ll-UTP and/or Bio-7-dATP, both of which 
5 are commercially available. 

The two fragment mixtures are • now combined and heat- 
denatured to single strand form. Preferably the negative- 
source fragments are present in substantial molar excess , 
typically about a tenfold excess, to insure that non-unique 

10 fragments in the positive-source mixture have a high probabi- 
lity of hybridizing with complementary strands from the nega- 
tive-source fragments. 

The positive- and negative-source fragments are hybri- 
dized under conditions in which hybridization between the 

15 linkers in the two different mixtures does not occur. This 
may be accomplished, as noted above, by using differentse- 
quence linkers in the two mixtures, by cleaving the linkers 
from the negative- source fragment mixture prior to hybridiza- 
tion, or by hybridizing the fragment strands under conditions 

20 which preclude linker hybridization. Alternatively, the 
hybridization mixture may contain a large molar excess of 
primer oligonucleotides effective to hybridize with the posi- 
tive-source linker end regions, thus preventing inter-strand 
linker hybridization. 

25 The hybridization reaction may be carried out according 

to classical hybridization techniques, such as detailed in 
Hames, or more preferably, by rapid hybridization techniques, 
such as the phenol emulsion reassociation technique (PERT) , as 
described by Kohne, or the guanidinium thiocyanate method, as 

30 described by Thompson. 

In the PERT hybridization method, the fragments are 
hybridized at a final DNA concentration of preferably between 
about 5-50 ug/ml in a phenol reaction mixture, and the reac- 
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tion is carried out at 60-70°C for up to 5-6 hours* Alterna- 
tively/ the reaction can be carried out at room temperature in 
the presence of formamide. The rate of annealing can be fol- 
lowed by hyperchromic shift at 280 nm or by diluting an ali- 
5 quot of the reaction mixture with buffer, and quickly passing 
the material through a hydroxyapatite (HAP) column (Kohne) . 

In the guanidine thiocyanate method, the fragments are 
hybridized at a final DNA concentration of preferably between 
about 5-50 ug/ral in a 4M guanidine thiocyanate solution con- 
10 taining 8 mM DTT, and 20 mM Na citrate, pH 5.8* The mixture 
is heated to 65°C for 5 minutes, to insure complete denatura- 
tion, then incubated at 26°C for periods up to 72 hours, until 
complete hybridization has occurred (Thompson) • The formation 
of DNA/DNA hybrids can be confirmed as above. 

15 

(b) Isolating Unique-Sequence Fragments 

Following hybridization, the reaction components are 
contacted with a solid support effective to preferentially 
bind non-unique fragments. In a preferred embodiment, where 

20 the negative-source strands are biotinylated, the reaction 
components are separated on an avidin or streptavidin column 
which is effective to bind all biotinylated strands and non — 
biotinylated (positive-source) strands which are hybridized 
therewith. Since the biotinylated strands are present in 

25 large molar excess, virtually all common-sequence strands from 
the positive -source will be bound to a complementary, biotiny- 
lated strand, and thus removed by the affinity chromatography. 

Methods for separating the hybridization mixture on a 
streptavidin column are detailed in Example 3. Briefly, the 

30 hybridization mixture is passed through a column packed with 
streptavidin-agarose and washed extensively with elution 
buffer. The eluates from several washes are pooled and the 
nucleic acid is concentrated by ethanol precipitation, yield- 
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ing a fragment mixture which is enriched for sequences which 
are unique to the positive-source material. 

According to an important feature of the invention, the 
fragments isolated will contain the end linkers originally 
5 attached to the posit iver-source fragments, and thus can be 
further amplified, by the amplification method described in 
Section C. Where the positive^source linker is different than 
that on the negative-source fragments, additional amplifica- 
tion (employing the "positive -source" primer) will selectively 

10 amplify positive-source fragments, providing further enrich- 
ment of sequences which are unique to the positive source. As 
indicated in Figure 4, the isolated amplified material may be 
rehybridized with the negative source fragments (which may be 
additionally amplified if necessary), and unique fragments 

15 again isolated by affinity chromatography, to enhance enrich- 
ment. 

The final isolated fragments can be treated with DNA 
polymerase to ensure complete strand replication in the duplex 
species, then treated with a selected linker-site endonucle- 
20 ase, to provide fragment ends suitable for cloning or the 
like. Alternatively, or in addition, the isolated fragments 
can be radiolabeled according to known procedures, for use as 
a DNA probe. 

(2) Isolation and Cloning of Viral Agent 

Figure 5 illustrates one application of the subtraction 
method, for isolation and cloning of a viral agent present as 
RNA transcript in an infected source, such as selected tissue 
or body fluid (e.g. bile). Here RNA mixtures isolated from 
infected (positive) and non-infected (negative) sources are 
used to produce cDNA duplex fragment mixtures, which are 
blunt-ended, ligated to a selected linker, and amplified as 
above. After biotinylating the negative-source fragments, the 
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two fragment mixtures are combined, denatured, reannealed, and 
separated by affinity chromatography (Section D-l) . The 
isolated fragments, which contain a selected restriction site 
in their end-linker regions, are cut at this site and cloned 
5 in a suitable expression vector, such as the lambda gtll 
vector. The resulting lambda gtll phage constitute a fragment 
library enriched for sequences unique to the positive source 
material . 

To select for desired isolates which produce a viral 
10 antigen, host cells are infected with library phage, and 
reacted with antiserum from an infected individual, to bind 
virus-related antibodies to host cells producing viral anti- 
gens. The cells are then washed to remove unbound antibody 
and contacted with a reporter-labeled anti-human IgG antibody, 
15 to label cells which have bound virus-specific antibodies. 
These cells are identified by the presence of the reporter. 
The vector (s) containing a viral insert can be used for con- 
tinued antigen production, or as a source viral sequence 
probe . 

20 Alternatively, the cloned isolated fragments in the 

fragment library can be identified and selected using DNA 
probes. By way of illustration, the fragment library genera- 
ted as above can be replica plated and hybridized with labeled 
cDNA probes from infected and non-infected sources. Probe 

25 hybridization can be carried out by conventional Southern 
blotting. Those library vectors which contain a viral-agent 
insert will hybridize with viral-agent cDNA probes present in 
the positive-source cDNA probes, but not with negative-source 
probes . 

30 



WO 91/05861 PCI7US90/05938 



23 

3. Identification and Isolation of Cellular Nucleic Acid 
Products 

Methods similar to the DNA subtraction and probe selec- 
tion method just described can be applied to identifying and 
5 isolating nucleic acid fragments which are unique to cells 
expressing oncogenes, or cells deficient in transcripts re- 
lated to genetic disease. 

Figure 6 illustrates how the amplification technique 
described in Section C can be used to confirm that the unique- 

10 sequence fragment clone (s) identified as above are in fact 
unique to the positive source. Here the two cDNA fragment 
mixtures prepared from positive and negative sources are 
amplified, as in Section C, and fractionated by gel electro- 
phoresis. The size fractionated gel fragments are then trans- 

15 ferred by conventional Southern blotting to filters and hybri- 
dized with the isolated cloned insert of interest, after probe 
labeling. If the probe sequence is unique to the positive 
source fragments, hybridization will be seen with amplified 
positive-source fragments only; as indicated in Figure 6, the 

20 region of probe binding is fairly broad, as would be expected 
since the amplified fragments will have a range of sizes. 
This method of screening amplified cDNA fragments is detailed 
in Example 4, where the method has been used to confirm that 
an isolated ET-NANB fragment insert was unique to cDNA from an 

25 infected bile source. 

4. Identifying cDNAs representing linked genes 

There is no general rule for the chromosomal distribu- 
tion of genes of like structure or function. However, gene 
30 duplication followed by divergence is a common mechanism for 
increasing the complexity and functional specialization of 
gene families. Although gene duplication could occur by 
transposition or retroposition, observations in a large number 
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of gene families suggest that duplications arose by tandemiza- 
tion of genes, perhaps as a result of unequal crossing over. 
Tandemization would create closely linked clusters of genes, 
which could then become separated by events such as chromosome 
5 breakage. However, even in cases where related gene are 
located on more than one chromosome, • as in the genes of the 
complement system (Leonard et al.) or the genes for serum 
lipoproteins (Aegerter-Shaw) , subsets of the gene system may 
occur as closely linked genes. 

10 There are a large number of cases in which related genes 

are linked in clusters containing few unrelated intervening 
genes. Examples include genes for the related peptides of 
certain pituitary and placental hormones (D'Eustachio et al.), 
cluster of immune response and class I major histocompatibi- 

15 lity genes (Barsh et al . ) , clusters of genes for some inter- 
ferons (Weiss et al . ) , genes for Ul and U2 RNAs (Erickson et 
al . ) , human metallothionein genes (Westin et al . ) , histone 
genes (Karin et al.), proline-rich salivary protein genes (Old 
et al . ) , genes for the constant regions of heavy or light 

20 chains of immunoglobins (Azen et al.), and alpha globin gene 
clusters (Tonegawa) . The two globin gene clusters are espe- 
cially relevant, because there are linked embryonic genes 
within these clusters which have important functions in early 
development (the zeta gene within the alpha cluster and the 

25 epsilon gene within the beta cluster) . These would not have 
been readily detected by cross hybridization with the adult 
gene, and therefore serve to illustrate the point that cross 
hybridization is not necessarily a good test of relatedness 
between clustered genes. Also relevant to this point are the 

30 problems encountered in attempting to isolate related lympho- 
kines using cross-species homologies. Thus, the mouse IL-3 and 
GM-CSF genes share only limited homologies with their human 
counterparts and point to the possibility that growth factor 
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genes may have diverged very rapidly after tandemization (Wong 
et al . , Yank et al . ) . 

In general, genes which have arisen by duplication of a 
common ancestor prior to or in the early phases of the mam- 
5 malian radiation have diverged to the extent that they cannot 
be readily detected by cross-hybridization of nucleic acid 
sequences. These observations indicate that, where divergent 
genes exist that arose by duplications of a common ancestor, 
moderate- to high- stringency hybridization approaches are 

10 probably not an adequate tool for their detection, whereas 
coding sequence comparisons between cDNAs may very well reveal 
functional similarities. 

The cDNA amplification method described herein makes 
possible a method to isolate cDNAs that map to specific large 

15 DNA fragments. Isolation of these cDNAs then allows limited 
cDNA sequence analysis to determine whether they might encode 
interesting gene products and also provides probes to survey 
the normal tissue distribution of their corresponding tran- 
scripts. One application of this method has been directed to 

20 the isolation of new growth controlling genes. 

The isolation of new growth-controlling genes would have 
considerable significance in investigating the normal physio- 
logical function of growth factors and in the production of 
sufficient quantities of material for the assessment of poten- 

25 tial therapeutic applications. Figure 7 shows the regional 
assignments of potential growth factor/receptor genes within a 
subregion of chromosome 5. Five of these loci encode CSFs/in- 
terleukins. CSFs are a family of glycoproteins that are 
believed to be necessary for the growth and maturation of 

30 myeloid progenitor cells (Clark et al.) . GM-CSF has been 
localized to 5q23-31 (Huebner et al., LeBeau et al., 1986). 
IL-3 (multi-CSF) is located within 5q23-31 (LeBeau et al., 
1987) . Initially, cDNA isolation efforts have been focused 
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predominately on the 5q23.3-q32 region of human chromosome 5 
which contains eosinophil colony stimulating factor, also 
called T-cell replacing factor, (interleukin 5: IL-5) and the 
B-cell growth factor (IL-4) (Le Beau et 'al, VanLeeuwen et al) . 
5 A variety of sources can be used for the genomic DNA. A 

standard source is, of course, genomic DNA isolated from a 
tissue or a cell line. Hybrid cell lines also provide a con- 
venient source of genomic starting material. Large DNAs of 
greater than 50 kb can be separated by a number of techniques, 

10 including pulse field gel electrophoresis (Carle et al., Smith 
et al., Chu et al.), to provide an appropriate starting mate- 
rial which has been enriched for the target region. Another 
convenient source of genomic starting material can be genomic 
DNA banks created in a cloning vector. There exist a number 

15 of vectors, known to one of ordinary skill in the art, which 
are useful for cloning large pieces of DNA:. such useful 
cloning vectors include modifications of phage lambda and 
yeast artificial chromosomes. Lambda vectors (eg., Proto- 
clone™ lambda gtlO, Promega) have been developed for creating 

20 genomic libraries which can tolerate relatively large inser- 
tions: for example, lambda gtlO can contain up to 7.6 kb 
inserts. Yeast artificial chromosomes (YAC) are in fact a 
preferred vector system since they provide the convenience of 
maintenance in a unicellular organism and have a capacity for 

25 very large inserts: up to several hundred kilobases (Burke et 
al.). In fact, the efficacy of the YAC system in the con- 
struction of a human genomic DNA bank has been demonstrated 

(Burke et al.) • 

A variety of methods to identify genomic DNA fragments 
30 of interest exists, such as hybridization with a labelled 
probe, but in many cases, are extremely limited by the availab- 
ility of sufficient quantities of target sequences for iden- 
tification and subsequent molecular manipulations. The ampli- 
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f ication method described in Section C is of great value in 
this regard since it provides a method to amplify a group of 
DNA molecules independent of prior knowledge of their nucleic 
acid sequence; an application of this method is presented in 
5 Example 5. 

The overall scheme for purification of a genomic DNA 
fragment derived from the 5q23-31 region of human chromosome 5 
is shown in Figure 8 and described in detail in Example 5. 
The source of starting DNA was the hamster/human hybrid cell 

10 line HHW105 (Overhauser et al.); this hybrid contains chromo- 
some 5 as its only human material* Accordingly, use of this 
hybrid cell line provided an immediate enrichment of human 
chromosome 5 sequences. Initial pulse-field gel electrophore- 
sis (Burke et al.) mapping using an IL-5 specific probe showed 

15 a Not I IL-5-containing fragment of 500 kilobases (kb) which 
was reduced by about 50 kb upon double-digestion with Nrul; 
this fragment was chosen as the genomic DNA target fragment. 

The HHW105 cell hybrid genomic DNA was cut with Not I ; 
the resulting fragments were resolved on a CHEF gel (Smith et 

20 al., Chu et al.). Since CHEF gel tracks are straight rather 
than curved this gel system allowed the isolation of discrete 
size DNA fractions by horizontally slicing the gel. The IL-5 
containing fraction was identified (page 9) by taking a small 
sample of each gel slice and subjecting the sample to the 

25 polymerase chain reaction assay (Scharf et al.) using IL-5 
specific primers. 

Our modifications of the polymerase chain reaction 
conditions make it unnecessary to purify the DNA away from the 
agarose in the gel slice sample, unlike previous methods which 

30 employ a phenol /chloroform extraction step to eliminate the 
agarose. The ability to eliminate the phenol/chloroform 
extraction step greatly facilitates performing the number of 
assays necessary to identify the genomic fragment of interest. 
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An approximation of the degree of purification that this 
method of genomic fragment isolation affords can be made by 
either estimating the amount of ethidium bromide fluorescence 
in the purified band vs. total, or by directly measuring the 
5 amount of DNA present in the size fraction by spectrofluoro- 
metry. Human chromosome 5 constitutes- about 5%, by length, of 
the human genome and is present at one copy per genome in the 
diploid HHW1 05 thus constituting about 2.5% of all DNA present 
in the hybrid genome (Wasmuth personal communication) . The 

10 following calculations can be made to assess the degree of 
purification required to isolate one -specific large fragment 
of human DNA. The diploid genome size of human and hamster is 
6 X 10 9 base pairs (bp). Therefore, in HHW105 about 1.5 X 10° 
bp of DNA will be contained in chromosome 5 (2.5% of 6 X 10 9 

15 bp) . Assuming an average NotI fragment length of approximately 
500 kb, chromosome 5 would contain. 300 Not I fragments; accord- 
ingly, about a 300-fold purification would be required to 
isolate only one of these fragments. An 80-fold purification 
is readily obtained on one pulsed-field gel (Michiel et al) 

20 and two gel purification steps can raise this to at least 300- 
fold. 

The IL-5-containing NotI genomic fragment was cut with 
Nrul and this sample resolved on a second CHEF gel. The gel 
was sliced as above and an aliquot of each gel slice subjected 

25 to polymerase chain reaction amplification with IL-5 specific 
primers. These amplified samples were then size fractionated 
on a conventional agarose gel and the DNA molecules trans- 
ferred to nitrocellulose. In this case the IL-5-containing 
fraction was identified by probing the nitrocellulose paper 

30 with a radiolabeled IL-5 specific probe (Figure 10) . 

The lL-5-containing NotI /Nrul fragment and contaminating 
hamster fragments were further size fractionated, the A lin- 
kers (see Section B) ligated to the molecules, and the DNA 
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fragments amplified by the method described in Section C 
These amplified fragments were then size fractionated on a 
conventional agarose gel (Figure 11) and transferred to a 
nitrocellulose filter, a variety of solid supports are avail- 
5 able which will perform the same function as the nitrocellu- 
lose filter such as GeneScreen™ (DuPont-New England Nuclear) 
or the solid support described above in section 2A(ii) . 

Example 7 describes the selection of cDNA molecules 
who.ch are homologous to the genomic regions isolated in Ex- 
10 ample 5. The cDNA molecules were selected from a T-cell cDNA 
library (Example 6) . cDNA molecules can be amplified before 
and/or after selection. For example, vector sequences flank- 
ing the 5' and 3' end of the cDNA inserts can be used as end- 
terminal primers to amplify the cDNA inserts before any selec- 
15 tion. The 5' ends of the primers can also contain sequences 
encoding restriction sites which will subsequently simplify 
insertion of the cDNA molecules into cloning/expression vec- 
tors. 

Although a cDNA library was used in the 1L-5 example 
20 (Example 7) , the initial nucleic acid need not be a cDNA. It 
could, for example, be an RNA molecule where, after the ini- 
tial selection by hybridization from a population of molec- 
ules, a first strand of homologous DNA is synthesized by a 
known procedure, such as random priming (Boehringer-Mannheim 
kit) or oligo-dT priming (Maniatis et al.), and second strand 
synthesis can be primed in the classic hairpin-primed fashion 
(Maniatis et al . ) . 

Extensive washing of the filter, after hybridization, 
using relatively stringent conditions was performed to mini- 
mize non-specific binding. since different linkers were used 
for amplification of the selected cDNAs versus the genomic 
sequences (the C and A primers, respectively, described in 



25 



30 



WO 91/05861 



PCT/US90/05938 



30 

Section B) , any genomic sequences which come off the filter 
during the elution steps will not be amplified. 

This selection technique is rapid and, to some extent, 
serves to equalize the abundance of selected cDNAs. This 
5 equalization occurs because the copy number of the genomic 
exon sequences on the filter should- be roughly equivalent, 
despite differing abundances of the cDNAs homologous to each 
to these genomic sequences. Thus, when the cDNAs are in 
excess, a normalization should occur relative to the genomic 
10 sequences. 

In the case of IL-5, Figure 12 shows the ethidium bro- 
mide staining of an agarose gel on which the amplified cDNA 
molecules, which were selected by the above described method, 
were resolved. Further, Figure 13 shows the hybridization of 

15 various probes with the amplified cDNAs. As can be seen from 
the figure, IL-5 specific sequences are present as expected. 
Further, a smear of hybridization corresponding to cDNA se- 
quences homologous to total human DNA is also present suggest- 
ing a variety of cDNA molecules other than IL-5 have been 

20 isolated by this method. 

The method applied to the isolation of cDNAs homologous 
to the IL-5 containing region of chromosome 5 has also been 
applied to the IL-3 (Figure 7) containing region of chromosome 
5. As can be seen from Figure 14, the selection method was 

25 also efficacious for the isolation of cDNAs corresponding to 
the IL-3 region of chromosome 5. Using the genomic DNA clones 
we obtained from the IL-3 region we constructed a long-range 
restriction map of this region. Using a probe specific for GM- 
CSF we physically mapped GM-CSF to be within 10 kilobases of 

30 IL-3 (Figure 15) . When cDNAs, selected on the basis of their 
hybridization to the IL-3 region genomic DNA probes, were 
tested for their hybridization to a GM-CSF specific probe a 
positive signal was obtained. This result further demonstrates 
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the ability of the instant method to identify cDNAs present in 
the same chromosomal region as a selected gene of interest 
(such as IL-3) . 

One important consideration in. the above selection 
5 technique is the quenching of genomic repetitive sequences 
within the various cloned DNAs. Quenching can be achieved by 
one or both of the following treatments: 1) pre-hybridization 
of the filter; and 2) pre-hybridization of the cDNA library in 
solution before the filter selection step. Both of these 
10 approaches tie up the repetitive sequences and prevent them 
from acting as a homologous hybridization site in the selec- 
tion step (e.g. the hybridization of the filter with the cDNA 
library) . Effective quenching can be achieved in the follow- 
ing ways: 

15 0ne approach to quenching is pre-hybridization to an 

intermediate C 0 t value (Litt et al., Sealey et al.) with total 
randomly sheared human genomic DNA. 

Another approach to quenching genomic repeats is to use 
a chromosome-specific genomic DNA library from a chromosome 
20 other than the target chromosome for pre-hybridization: in the 
case of IL-5, a human chromosome other than human chromosome 
. 5. 

Finally, the most preferred approach to quenching geno- 
mic repeats is to use a library of isolated repetitive sequen- 

25 ces representative of various abundance classes of repeat 
sequences. This set of repeats is isolated by screening a 
human genomic DNA library with total radiolabeled human DNA at 
a low plaque density. A mixture of plaques which show low, 
intermediate, and high density signals are picked and these 

30 sets of sequences can be used as the blocking reagent for pre- 
hybridization. Alternatively, such a library can be generated 
by shearing total human genomic DNA into short lengths, dena- 
turing and reannealing to an intermediate C„t value and clon- 
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ing the repaired double-stranded molecules. In general, an 
initial blocking of the filter with sonicated herring sperm 
DNA is followed by pre-hybridization blocking using the repeat 
library . 

5 To improve the representation of cDNAs corresponding to 

a given genomic region, a further step of hybridization selec- 
tion is performed employing, for example: (1> a different 
starting combination of restriction enzymes used to generate 
the partially purified genomic size fraction; and/or (2) a 

10 different source of the genomic DNA region of interest (eg. a 
yeast artificial chromosome clone that contains the appropri- 
ate genomic region) . The set of eluted cDNAs, from above, 
would be hybridized to these sequences. In this way the chan- 
ces of re-electing contaminating cDNAs would be minimized and 

15 the chances of isolating only cDNAs from a specific region 
would be maximized. 

Once cDNA molecules homologous to a selected chromosomal 
region have been isolated, the cDNAs are cloned (Protoclone™ 
lambda gtlO) and plated at low plaque density. The phage DNAs 

20 are transferred to filters and hybridized (Woo) with a label- 
led repetitive sequence library or with total radiolabeled 
human DNA to distinguish between clones which contain repeat 
sequences from clones which contain unique sequences. DNA is 
then isolated from a representative sample of the unique 

25 sequence clones (500 clones in the case of IL-5) and the cDNA 
inserts isolated by EcoRI digestion. Aliquots of the cDNA 
inserts are spotted to nitrocellulose filters. Then the cDNA 
inserts are nick-translated and used to probe the nitrocellu- 
lose filters to identify duplicate clones (Southern) ; dupli- 

30 cates are then discarded. 

The clones are then mapped to chromosome 5 using a mini- 
panel of the hybrid HHW105 DNA, hamster DNA, and human DNA. A 
large collection of various digest of these DNAs as Southern 
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blots have been prepared on nylon filters. Clones that map to 
chromosome 5 and are unique can be positioned on the map of 
the genomic DNA surrounding the IL-5 gene utilizing hybridiza- 
tion back to the large DNA fragments that were originally 
5 resolved by CHEF gel electrophoresis. Clones which are found 
to contain repeats will be pre-quenched or subcloned prior to 
further analysis. 

This method provides a novel and efficient way to iden- 
tify cDNAs corresponding to a contiguous region of genomic DNA 
10 when conventional probes are unavailable, and also the method 
provides a means to identify previously unknown gene products 
of potential research and therapeutic value. 

5. Identifying Homologous Viral Nucleic Acid Sequences 

15 The methods of the instant invention are useful for the 

analysis of viral genome structure and function. Complete DNA 
copies of viral genomes can be used as probes or, alternately, 
selected fragments of the genome can be utilized* In the case 
of RNA viruses DNA copies of the genome can be synthesized, as 

20 by reverse transcriptase, or in some cases intermediate DNA 
forms of the virus may be isolated (Kashmiri et al . ) . Im- 
mobilized viral genomic DNA can be used to screen the nucleic 
acids of infected cells: for example, cDNA libraries may be 
made from the RNAs of infected cells and these cDNAs isolated 

25 on the basis of their sequence homology to the viral genomic 
DNA probe. The ability to amplify cDNA molecules independent 
of their specific sequence, that is, by use of the linkers 
(Figure 2) of the instant invention (e.g. Example 6), is 
helpful in the identification of normally low abundance mes- 

30 sages. Further, the ability to amplify the cDNAs eluted from 
the viral genomic DNA probe provides another increase in the 
capability of the method to identify cDNAs having infrequent 
hybridizations. 
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The isolated cDNAs can then be mapped to the viral 
genome to assign coding regions. This method can also be used 
to isolate cellular proteins which are related to the viral 
genomic sequences. 
5 The methods of the instant invention can also be applied 

to the analysis of the nucleic acid sequences of heterologous 
viruses. For example, a related virus can be used to help 
characterized a new virus and identify related coding sequen- 
ces. This approach has been used to examine relationships 

10 between the bovine leukosis virus (BLV) and a newly isolated 
human-lymphoma associated virus (HuLAV, co-pending U.S. ap- 
plication 361,855, herein incorporated by reference). The 
structural proteins of HuLAV viruses demonstrate a high degree 
of homology to other known retroviruses; in particular, the 

15 seroreactivity of individuals seropositive for the 10C9 HuLAV, 
(the HuLAV carried in cell line 10C9) with BLV envelope pro- 
tein suggests that the putative envelope protein of HuLAV 10C9 
may share immunogenic epitopes with BLV. 

A cDNA library prepared from HuLAV 10C9 particles (E- 

20 xample 9-B) was hybridized against a BLV genomic DNA probe 
bound to a nitrocellulose filter, the filter washed, the 
selected cDNAs eluted and cloned into lambda gtll. No cDNA 
molecules were selected which gave a positive signal when the 
lambda gtll clones were immunologically screened with anti-BLV 

25 sera or anti-gp62 sera (gp62 is an envelope protein of another 
related retrovirus, HTLV-I; Kiyokawa et al) . 

. However, HuLAV 10C9 cDNAs isolated by the methods of the 
instant invention, as described in Examples 9-10, yielded the 
following results: hundreds of lambda gtll cDNA clones posi- 

30 tive with anti-BLV; 15 clones positive with anti-p24 (p24 is 
an approximately 27,000 molecular weight HuLAV core protein) 
of which 4/15 cross-reacted with anti-BLV; and 4 positive 
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clones detected with anti-gp62 of which all four cross-reacted 
with anti-BLV. 

Accordingly, use of the instant methods applied to the 
isolation of homologous nucleic acid sequences between the 
5 related but distinct viruses BLV and HuLAV resulted in abi- 
lity to detect homologous nucleic acid sequences where stan- 
dard technology had failed. This increased ability was evi- 
denced by the immunological reactivities of the lambda gtll 
expression products of the cloned selected cDNAs. Further, 
10 when the selected cDNAs were cloned in lambda gtlO and then 
screened with full length BLV probes, 76 positives were detec- 
ted out of 2 X 10 4 clones screened. These results illustrate 
the increased sensitivity which can be achieved by employing 
the methods of the instant invention. 

15 

6. Enrichment of Selected Genomic DNA Fragments 

One difficulty in mammalian genetics is having adequate 
amounts of starting material available for analysis. The pre- 
sent amplification method provides' a straight-forward method 

20 to generate a library which is highly enriched in sequences 
derived from a selected genomic region independent of dif- 
ferent sequence species. Genomic DNA fragments may be isolated 
from a contiguous genomic DNA section, eg. by size fractiona- 
tion on a gel, fragmented to form fragments of less than about 

25 2 kilobases in size, the linkers of the instant invention 
ligated onto the fragments, and the fragments amplified. As 
illustrated above for the IL-5 region of chromosomes, genomic 
sequences can be isolated for a region of interest. These 
genomic sequences can then be amplified independently of the 

30 specific fragment sequence using the linkers of the instant 
invention (Example 5-E) and then subcloned into an appropriate 
vector (eg. lambda gtlO, Protoclone™, Promega) . These cloned 
genomic sequences are then available for a variety of manipu- 
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lations such as use as probes (Examples 7-8, 9-10), especially 
in genetic linkage analysis, and the generation of long-range 
restriction maps. Such long-range restriction maps are con- 
structed by restriction digestion of the various clones, size 
5 fractionation of their fragments, construction of restriction 
maps for each clone, and finally lining up overlapping regions 
between clones. Libraries generated in this way provide 
excellent representation of the genomic region of interest. 
7. Identification of Disease-related Loci 

10 The molecular identification and analysis of genes 

corresponding to disease-related loci ■ has been a major chal- 
lenge of molecular genetics. Genetic linkage data exists for 
a number of genetic diseases (e.g. familial schizophrenia 
(Sherrington et al.) and in some cases physical maps of the 

15 specific chromosomal regions are available (e.g. the cystic 
fibrosis locus has been mapped to an approximately 500 kb 
region (Poustka et al.)), growth factors involved in certain 
neoplasms such as the familial polyposis locus isolated to a 
small region of human chromosome 4 (Bodmer et al.). The 

20 method of the invention may be useful to aid in the identifi- 
cation of specific disease-related gene products. Figure 16 
provides a general outline of approaches which may be taken 
leading to the identification of specific disease-related loci 
and their gene products. 

25 Using the techniques outlined above for the Isolation of 

genomic DNA fragments, long range restriction maps can be 
constructed of the genomic DNA region corresponding to a 
disease-related locus. A number of potential starting mate- 
rials containing the genomic DNA of interest are available as 

30 discussed in section II-B, such as, yeast artificial chromo- 
somes, hybrid cell lines, and DNA fragments obtained by 
chromosome jumping (Collins et al.). Then, by application of 
the techniques used for the isolation of the genomic DNA in 
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the region of IL-5, known genetic markers for the regions of 
disease related loci can likewise be used to isolate genomic 
DNAs from these regions and then these selected genomic DNAs 
can be used as probes to select corresponding cDNA molecules 
5 from normal and disease-state samples. 

Candidate cDNAs from the region- of interest can then be 
isolated and compared by a variety of methods to determine 
differences between normal and disease state samples. One 
approach is the use of the subtraction method described above 
10 to detect cDNAs which are present in one sample, but not 
present in another. Another approach is to directly select a 
small set of cDNAs using the technology described above. To 
identify specific mutations in the isolated cDNAs a number of 
systems can be used including the following: (1) the use of 
15 modified denaturing gradient polyacrylamide gels to identify 
single base pair alterations, and (2) the use of incorporated 
biotinylated nucleotides to mark the positions of mutations 
determined by comparison to wild type DNAs. Also, partial DNA 
sequence analysis can be used to facilitate matching and 
20 comparisons of candidate cDNAs obtained from different sam- 
ples, as well as to eliminate candidates based on their homol- 
ogy to known sequences present in sequence data bases (e.g. 
N.I.H. GeneBank) . 

The following examples illustrate the method of fragment 
25 amplification and isolation, and of the cDNA isolations de- 
scribed above, but are in no way intended to limit the scope 
of the method or its application. 

Example 1 

30 General Procedures and Materials 

Plasmid piAN13 was obtained from Dr. Brian Seed (MIT) . 
E - coli strain KM392, a suppressor positive strain, was ob- 
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tained from Dr. Kevin Moore, DNAX, Palo Alto, CA. E. coll 
strain LG75, a suppressor minus strain, was obtained from Dr. 
Brian Seed (MIT) . Lambda phage Ch21a was obtained from Dr. 
Frederick Blattner (University of Wisconsin) , and a Not I site 
5 was inserted between the EcoRI and Hlndlll sites (Ch21aLJ) , by 
replacing the 2 kb EcoRI/ Hindi I I fragment with an oligonu- 
cleotide containing a NotI site. E. coli strain MC1061 har- 
boring the P3 plasmid was obtained from Dr. Brian Seed (MIT) . 

Terminal transferase (calf thymus) , alkaline phosphatase 

10 (calf intestine) , polynucleotide kinase, E. coli DNA poly- 
merase I (Klenow fragment) , and SI nuclease were obtained from 
Boehringer Mannheim Biochemicals (Indianapolis, IN); phiX174 
fragments, produced by digesting the phiX174 phage to comple- 
tion with Haelll, were obtained from BRL Laboratories (Bethe- 

15 sda, MD) . 

Sma l, EcoR I, NotI , T4 DNA ligase and T4 DNA polymerase 
were obtained from New England Biolabs (Beverly, MA) ; and 
streptavidin agarose, from Bethesda Research Labs (Bethesda, 
MD) . Low-gelling temperature agarose (Sea Plaque) was ob- 

20 tained from FMC (Rockland, ME) . Nitrocellulose filters were 
obtained from Schleicher and Schuell. 

Synthetic oligonucleotide linkers and primers were 
prepared using commercially available automated oligonucleo- 
tide synthesizers. Alternatively, custom designed synthetic 

25 oligonucleotides may be purchased, for example, from Synthetic 
Genetics (San Diego, CA) . cDNA synthesis kit and random prim- 
ing labeling kits were obtained from Boehringer-Mannheim 
Biochemical (BMB, Indianapolis, IN) . 

Kinasing of single strands prior to annealing or for 

30 labeling is achieved using an excess, e.g., approximately 10 
units of polynucleotide kinase to 1 nmole substrate in the 
presence of 50 mM Tris, pH 7.6, 10 mM MgCl 2 , 5 mM dithioth- 
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reitol, 1-2 mM ATP, 1.7 pmoles gamma 32 P-ATP (2.9 mCi/mmole) , 
0.1 mM spermidine, 0.1 mM EDTA. 

Site-specific DNA cleavage is performed by treating with 
the suitable restriction enzyme (or enzymes) under conditions 
5 which are generally understood in the art, and the particulars 
of which are specified by the manufacturer of these commer- 
cially available restriction enzymes (see, e.g., New England 
Biolabs, Product Catalog) . In general, about 1 |il of plasmid 
or DNA sequence is cleaved by one unit of enzyme in about 20 
10 \il of buffer solution after 1 hr digestion at 37°C; in the 
examples herein, typically, an excess of restriction enzyme is 
used to insure complete digestion of the DNA substrate. 

Incubation times of about one hour to two hours at about 
37°C are workable, although variations can be easily tolera- 
15 ted. After each incubation, protein is removed by extraction 
with phenol/chlorof orm, and may be followed by. ether extrac- 
tion, and the nucleic acid recovered from aqueous fractions by 
precipitation with ethanol (70%) . if desired, size separation 
of the cleaved fragments may be performed by polyacrylamide 
20 gel or agarose gel electrophoresis using standard techniques. 
A general description of size separations is found in Methods 
in Enzymolocrv (1980) 65:499-560. 

Restriction cleaved fragments may be blunt-ended by 
treating with the large fragment of E. coli DNA polymerase I 
25 (Klenow reagent) in the presence of the four deoxynucleotide 
triphosphates (dNTPs) using incubation times of about 15 to 25 
min at 20°C to 25°C in 50 mM Tris pH 7,6, 50 mM NaCl, 6 mM 
MgClj 6 mM DTT and 0.1-1.0 mM dNTPs. The Klenow fragment 
fills in at 5' single-stranded overhangs in the presence of 
30 the four nucleotides. if desired, selective repair can be 
performed by supplying only one of the, or selected, dNTPs 
within the limitations dictated by the nature of the overhang. 
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After treatment with Klenow reagent, the mixture is extracted 
with phenol/chloroform and ethanol precipitated. Treatment 
under appropriate conditions with Si nuclease results in 
hydrolysis of any single-stranded portions of DNA. In par- 
5 ticular, the nicking of 5' hairpins formed on synthesis of 
cDNA is achieved. 

Ligations are performed in 15-50 \il volumes under the 
following standard conditions and temperatures: for example, 
20 mM Tris-Cl pH 7.5, 10 mM MgCl 2 , 10 mM DTT, 33 mg/ml BSA, 10 

10 mM-50 mM NaCI, and either 40 mM ATP, 0.01-0.02 (Weiss) units 
T4 DNA ligase at 14°C (for "sticky end" ligation) or 1 mM ATP, 
0.3-0.6 (Weiss) units T4 DNA ligase at 14°C (for "blunt end" 
ligation). Intermolecular "sticky end" ligations are usually 
performed at 33-100 mg/ ml total DNA concentrations (5-100 nM 

15 total end concentration) . Intermolecular blunt end ligations 
are performed at 1 mM total ends concentration. 

In vector constructions employing "vector fragments", 
the vector fragment is commonly treated with bacterial alka- 
line phosphatase (BAP) or calf intestinal alkaline phosphatase 

20 (CIP) in order to remove the 5' phosphate and prevent self-li- 
gation of the vector. Digestions are conducted at pH 8 in 
approximately 10 mM TrisHCl, 1 mM EDTA using about 1 unit per 
mg of BAP at 60°C for one hour or 1 unit or CIP per mg of 
vector at 37°C for about one hour. In order to recover the 

25 nucleic acid fragments, the preparation is extracted with 
phenol/ chloroform and ethanol precipitated. Alternatively, 
religation can be prevented in vectors which have been double 
digested by additional restriction enzyme digestion and separ- 
ation of the unwanted fragments. 

30 
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Example 2 
Amplifying Duplex DNA Fragments 
A. Modification and Amplification of piAN13 plasmid 

Plasmid piAN13 carries a supF suppressor gene capable of 
5 suppressing an amber mutation. The plasmid was linearized by 
digestion with Sma l and ligated to the linker whose sequence 
is shown at A in Figure 2, at a 1:100 molar ratio in the 
presence of 0.3-0.6 Weiss units of T4 DNA ligase. The linker 
sticky ends Were filled in with Klenow fragment, convent ional- 

10 ly, and the mixture was treated with Nrul to cleave linker 
dimers. Smal linkers were attached to the blunt-ends of the 
plasmid fragments under ligation conditions like those above. 
After treatment with Sma l to remove redundant Sma l linkers, 
the fragments were recircularized. This was done at a frag- 

15 ment concentration of about 1 jig/ml, in the presence of 4,000 
units/ml DNA ligase, at about 14°C for 18 hours. The result- 
ing plasmids were thus modified to contain an A-sequence 
linker on either side of the original Sma l site. 

E » coli strain MC1061 harboring the P3 plasmid was 

20 transformed with the recircularized plasmid, and selection for 
successful plasmids was made on the basis of bacterial growth 
in the presence of kanamycin, ampicillin, and tetracycline. 
Plasmids were isolated from the resistant bacterial colonies, 
and cut with Smal to produce blunt-end, linearized fragments 

25 having the desired A-sequence end linkers. 

To 100 ul of 10 mM Tris-Cl buffer, pH 8.3, containing 
1.5 mM MgCl 2 (Buffer A) was added 1 x 10-3 jig of the linearized 
plasmid, 2 uM of a primer having the sequence 
d(5'-GGAATTCGCGGCCGCTCG-3') , 200 uM each of dATP, dCTP, dGTP, 

30 and dTTP, and 5 units of Thermus aguaticus DNA polymerase (Taq 
polymerase) . The reaction mixture was heated to 94°C for 1 
minute for denaturation, allowed to. cool to 50°C for 2 min 
for primer annealing, and then heated to 72°C for 5-12 min to 
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allow for primer extension by Taq polymerase. The replication 
reaction, involving successive heating, cooling, and polyme- 
rase reaction, was repeated an additional 25 times with the 
aid of a Perkin-Elmer Cetus DNA thermal cycler. 

5 

B. Amplification of phiX174 Fragments 

To a 50 ill solution containing 100 ng of the Hae lll 
blunt-end fragments was added 300 ng of the linker whose 
sequence is shown at C in Figure 2, with ligation of the 
10 linker in the presence of DNA ligase being carried out as 
above. The mixture was treated with Nrul to cleave linker 
dimers . 

To 100 jil of Buffer A was added 1 ng of the 
phiX174/HaeIII fragments, 2 pM of a primer having the sequence 

15 d(5' -GACGCGTGAATTCTCG-3' ) , 200 \M each of dATP, dCTP, dGTP, 
and dTTP and 5 units of Taq DNA polymerase. The reaction 
mixture was heated to 94°C for 1 minute for denaturation, 
allowed to cool to 50°C for 2 min for primer annealing, and 
then heated to 72°C for 5-12 min to allow for primer extension 

20 by Taq polymerase. The replication reaction steps were repea- 
ted 25 times, as above. The amplified fragments are referred 
to below as phiX/linker-C fragments. 

A second preparation of phiX174 fragments was amplified 
by similar methods, but using the sequence-A linker employed 

25 in the piAN13 amplification. This preparation is referred to 
below as phiX/linker-A fragments. 

Example 3 
Selection for piAN13 Sequences 
30 A. Biotinylation of Amplified phiX174 Fragments 

The amplified phiX174/linker-C fragment produced in 
Example 2-B are treated with EcoRI to yield 5' protruding ends 
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having an AATT sequence overhang. The fragments are biotiny- 
lated by Klenow fill-in reaction according to known procedures 
(Maniatis, pp. 113). Briefly, the reaction mixture consists 
of 100 jig/ml fragments in 100 mM potassium phosphate (pH 7.2), 
5 2 mM C0C1 2 , 0.2 mM DTT, 40 jiM Bio-7-dATP, 100 pM dTTP, and 100 
units/ml Klenow fragment. After incubation at 25°C for 45 
minutes, the reaction is terminated by heating at 65°C for 10 
minutes . 



10 B. Hybridization Reaction 

Amplified piAN13 vector is mixed at various mole ratios 
with the phiX/Linker-A fragments produced in Example 2-B. 
Mixtures contained weight ratios of 1:100, 1:30 and 1:3 piAN13 
to phiX/Linker-A fragments are prepared. Each mixture is then 

15 mixed with a 10-fold molar excess of biotinylated 
phiX/Linker-C fragments prepared as above, in an annealing 
buffer containing 2M guanidinium isothiocyanate (pH 5.8). The 
mixtures are heated to 65°C for 5 minutes, then allowed to 
cool to 25°C and hybridized for a 12-hour period. 

20 

C. Separation by Affinity Chromatography 

A 1 ml siliconized syringe plugged with siliconized 
glass wool is packed with 0.5 ml streptavidin-agarose and 
washed with lOmM Tris-Cl, 0.5 M NaCl, pH 7.0, containing 1 mM 

25 EDTA (elution buffer) . A portion of each of the three hybri- 
dization mixtures from above is EtOH precipitated (70%) , 
resuspended in 10 mM Tris HC1, 1 mM EDTA pH 7.0, and then 
loaded onto the column which is washed with several volumes of 
column elution buffer. 

30 For each mixture, the eluted fractions from the strep- 

tavidin column are combined and the DNA concentrated by etha- 
nol precipitation. The DNA is resuspended in Buffer A to a 
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concentration of 0*01 jig/ml, and mixed with 2 fiM of the above 
primer having the sequence d (5' -GGAATTCCGGCCGCTCG-3' ) , and 200 
HM each of dATP, dCTP, dGTP, and dTTP, and 5 units Taq poly- 
merase . The reaction mixture is denatured at 94°C for 1 
5 minute, cooled to 50°C for primer annealing, and primer exten- 
sion performed at 72°C for 5-12 minutes. 

The fragments from the each of the three eluted mixtures 
above are cut with Not I to produce Not I sticky ends in the 
linker regions of the fragments , as can be appreciated from 

10 Figure 2A. The fragments (13 ^tg/ml) are mixed with 60 fig/ml 
Not l-digested phage Ch21aLJ, and inserted at the Not I site in 
the phage in the presence of T4 DNA ligase. After encapsula- 
tion, the phage are used to infect E. coli strain KM392, a 
suppressor-positive strain, and E. coli strain LG75, a sup- 

15 pressor-minus strain. The number of plaques produced in the 
KM392 strain provides a measure of total number. of recombinant 
phage, since this strain is able to suppress the phage amber 
mutation, and thus allow phage expression of its A and B 
genes. In the LG75 strain, by contrast, only phage which have 

20 incorporated the piAN13 fragment produce plaques, since the 
sup F gene is required for growth of phage in suppressor minus 
bacterial strains. The ratio of plaques on the two host 
strains thus indicates the ratio of piAN13 fragments in the 
fragment mixtures. As a control, the ratio of fragments in 

25 the three mixtures before hybridization and streptavidin 
separation is also determined. 

Example 4 

Screening Amplified cDNA Fragment Mixtures 
30 A» Bile-derived cDNAs 

Two cynomolgus monkeys were intravenously injected with 
a 10% suspension of stool positive for ET-NANB obtained from a 



WO 91/05861 



PCT/US90/05938 



45 

human volunteer. It was determined the stool was positive for 
ET-NANB by binding of viruslike particles (VLPs) in the stool 
to immune serum from a known ET-NANB patient. The monkeys 
developed elevated levels of alanine aminotransferase (ALT) 
5 between 24-36 days after inoculation, and excreted 27-30 nm 
VLPs in their stools in the pre-acute phase of infection. The 
animals became seropositive for VLPs in the inoculum. 

The bile duct of each infected animal was cannulated and 
about 1 cc of bile was collected. RNA was extracted from the 
10 bile by hot phenol extraction, using a standard RNA isolation 
procedure. Double-strand cDNA was formed from the isolated 
RNA by a random primer for first-strand generation, using a 
cDNA synthesis kit obtained from Boehringer-Mannheim (Indiana- 
polis, IN) . 

15 A cDNA fragment mixture from non-infected cynomolgus 

monkey bile was prepared similarly, 

B. Stool-Derived cDNAs 

cDNA fragments obtained from human stool samples were 
prepared as follows: a 10% suspension of stool samples from 
healthy or ET-NANB-infected individuals was layered over a 30% 
sucrose density gradient cushion, and centrifuged at 25 kg for 
6 hr in an SW27 rotor, at 15°C. The pelleted material con- 
tained 27-32 nm VLP particles characteristic of ET-NANB infec- 
tion in the infected-stool sample. RNA was isolated from the 
sucrose gradient pellets in both the infected and noninfected 
samples, and the isolated RNA was used to produce cDNA frag- 
ments as described in Example 2. 

30 C. Amplification of the cDNA Fragment Mixtures 

The cDNA fragment mixtures from infected and non- in- 
fected bile source, and from infected and noninfected human- 
stool source were each amplified by linker/ primer replica- 
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tion, according to the method of the invention. Brief ly, the 
fragments in each sample were blunt-ended with DNA Pol I then 
extracted with phenol /chloroform and precipitated with etha- 
nol. The blunt-ended material was ligated with linker A shown 
5 in Figure 2, under conditions like those described in Example 
2, and the mixtures were digested with Nrul to remove linker 
dimers. Each mixture was brought to a final concentration of 
0.01 fig/ml in Buffer A, and to this was added 2 (iM of a primer 
having the sequence d (5' -GGAATTCGCGGCCGCTCG-3' ) , and 200 \M 

10 each of dATP, dCTP, dGTP, and dTTP. The reaction mixture was 
heated to 94°C for 1 minute for denaturation and allowed to 
cool to 50°C for 2 min for primer annealing, and then heated 
to 72°C for 5-12 min to allow for primer extension by Taq 
polymerase. The replication reaction, involving successive 

15 heating, cooling, and polymerase reaction, was repeated an 
additional 25 times with the aid of a Perkin Elmer Cetus DNA 
thermal cycler. 

D. Screening Amplified cDNA Mixtures 

20 The four amplified cDNA fragment mixtures from above 

were fractionated by agarose gel electrophoresis, using a 2% 
agarose matrix. After transfer of the DNA fragments from the 
agarose gels to nitrocellulose paper, the filters were hybri- 
dized to a 32 P random-primed 1.33 kb fragment associated with 

25 ET-NANB viral agent. The derivation and sequence of this 
fragment, which is carried in a pTZ-KFl (ET1.1) plasmid identi- 
fied by ATCC No. 67717. The cloned fragment was obtained by 
digesting the pTZ-KFl (ET1.1) plasmid with EcoR I, (ii) isolat- 
ing the released 1.33 kb ET-NANB fragment, and (iii) random 

30 priming the isolated fragment. 

The probe hybridization with the fractionated cDNA 
mixtures was performed by conventional Southern blotting 
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methods (Maniatis, pp. 382-389) . Figure 6 shows the hybridi- 
zation pattern obtained with cDNAs from infected and non- 
infected bile sources. As seen, the ET-NANB probe hybridized 
with amplified fragments obtained from the infected source, 
5 but was non-homologous to sequences obtained from the non- 
infected source. Similar results were obtained with amplified 
fragments obtained from infected and non-infected human stool 
sources. This procedure demonstrates the utility of the 
described invention since hybridization analysis of an equiva- 
10 lent amount of cDNA prior to amplification would not have 
yielded detectable hybridization signal. 



Example 5 

Isolation of Genomic DNA from the region adjacent the gene 

15 encoding IL-5 

A. Preparation of Large DNA fragments 

The hybrid cell line HHW105 (Overhauser et al.) was used 
as the source of genomic DNA. This cell line contains all of 
human chromosome 5. The human chromosome is retained by selec- 

20 tion for the leucine aminoacyl tRNA synthetase gene on human 
chromosome 5 which complements a corresponding defective 
hamster gene. This hybrid cell line was employed to obtain an 
immediate enrichment of chromosome 5 sequences. 

The gene encoding IL-5 is located in the'5q31.3 region 

25 of chromosome 5 (LeBeau et al) . Genomic DNA was isolated from 
cell line HHW105 (Maniatis et al, Smith et al) and digested 
with a variety of restriction endonucleases . The resulting 
genomic DNA fragments were size fractionated by Pulse-Field 
Gel Electrophoresis (Carle and Olson) . The DNA fragments in 

30 this gel were then transferee to nitrocellulose (Southern) and 
the nitrocellulose filter hybridized with a random-primed IL-5 
cDNA fragment (Azume et al) . A NotI fragment which hybridized 
with the IL-5 probe was identified. This 500 kilobase (kb) 
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Not I fragment was reduced by about 50 kb upon double digestion 
with Nrul ; this Notl/Nrul fragment was chosen as a target 
region for purification and subsequent use in homologous cDNA 
selection. The purification of this fragment is outlined in 
5 Figure 8 and described below. 

5 X 10 9 cells were harvested and washed once in Dulbec- 
co's Phosphate Buffered Saline (PBS). The cells were resu- 
spended in PBS at 1 X 10 8 cells/ml to which was added an equal 
volume of L-buffer (50 mM NaCl, 50 mM EDTA, pH=8) and 1% low 

10 gelling temperature (LGT) agarose (FMC Bioproducts) held at 
50°C. Proteinase K was then added to a final concentration of 
0.5 mg/ml. The mixture was gently pipetted into block formers 
and the blocks allowed to solidify. These blocks were then 
placed in 50 ml conical tubes, totally immersed in 40 mis of 

15 0.5 M EDTA, pH=9, 1% sarkosyl, and 200 jig/ml proteinase K, and 
then incubated overnight at 50°C. The 40 mis of solution was 
then replaced with L-buffer containing PMSF at a final con- 
centration of ImM and incubated for 2 hours at room tempera- 
ture; this wash was repeated. The blocks were then washed 

20 twice , with 2 hour incubations for each wash, in TE (10 mM 
Tris HC1, pH=8, 1 mM EDTA) at room temperature. 

B. The First CHEF gel. 

A 1% LGT agarose, 0.5 X TBE (5 X TBE stock solution, per 

25 liter: 54 g Tris Base, 27.5 g boric acid, and 20 mis 0.05 M 
EDTA (pH=8)) gel was poured containing 4 conventional wells 
and one long preparative well. Sixteen blocks of HHW105 DNA 
were equilibrated in 1 X Not I reaction buffer (New England 
Biolabs) for two incubations of one hour each. The equili- 

30 brated blocks were incubated with Not I at 37°C, for four 
hours, under the following digestion conditions: 16 blocks in 
15 ml corex tube, 200 *il 10 X Not! buffer, 20 p.1 10 mg/ml 
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acetylated bovine serum albumin (BSA) , 200 jil Not I enzyme (10 
units per jtl) , 1560 til water. The blocks were then treated 
for 2 hours in 1 mM PMSF. One individual block, lambda DNA 
ladder size markers (cl85757, FMC Bioproducts) and Schizosac- 
5 charomyces pombe chromosomal DNA (972, FMC Bioproducts) were 
loaded in the conventional wells. The remaining blocks were 
heated to 65°C, melted and loaded in the molten state into the 
preparative well. The gel was run under standard CHEF gel 
conditions (Smith et al., Chu et al.) for 48 hours at 200 V 

10 with 82 second switching intervals. The portion of the gel 
containing the 4 conventional lanes was then cut away from the 
preparative well and stained with ethidium bromide (see Figure 
9-A, First CHEF gel) . Since CHEF gel tracks are straight 
rather than curved, DNA fractions covering size ranges across 

15 the gel could be collected. The preparative portion of the 
gel was sliced, using a glass cover slip, into 1-2 mm slices 
covering the size range from approximately 1000 kb to 150 kb 
for a total of 30 slices down the gel. 

20 C. Identification of the IL-5 containing fraction 

Identification of the IL-5 containing fraction was 
achieved by sequence-specific amplification using IL-5 speci- 
fic primers (IL-5-5' = 5 ' -GGGAATTCATGAGGATGCTTCTGCATTTG-3 ' ; 
IL-5-3' = 5' -GGAAGCTTTCAACTTTCTATTATCCACTCGGTGTTCATTAC-3' ) . 

25 DNA was extracted from a 1 mm by 2 mm portion of each slice of 
the gel; the rest of each slice was stored until the fraction 
containing the IL-5-homologous fragment was identified. The 1 
X 2 mm sample was placed in a microfuge tube, heated to 65°C 
until the agarose was molten (approximately 5 minutes), mixed 

30 gently and a 20 nl aliquot was placed on a piece of Parafilm™ 
and allowed to solidify to an agarose bead. The agarose bead 
was placed in 200 |d of 1 X polymerase chain reaction buffer 
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(Perkin-Elmer Cetus Gene Amplification Kit) and equilibrated 
at room temperature for 30 minutes. . The buffer was then 
removed and the following standard reaction mixture added to 
the agarose bead: 16 jil 1.25 mM dNTPs, 10 ill 10 X polymerase 
5 chain reaction buffer, 5 \il 20 pM IL-5-3' primers, 5 \il 20pM 
IL-5-5', 63.5 ill water. The mixture -was heated to 65°C until 
the agarose was molten and then 0.5 jil of Taq I DNA polymerase 
was added. The mixture was briefly vortexed, a thin layer of 
mineral oil placed on top of the reaction mixture, and the 

10 amplification reaction run under the following conditions: 
denature for 1 minute at 94°C, anneal' for 1 minute at 50°C, 
extend for 2 minutes at 72°C, repeat for 25 cycles. Following 
the amplification, a conventional agarose gel was run to 
determine the fraction containing IL-5 (see Figure 9, polymer- 

15 ase chain reaction products) ; in parallel three control poly- 
merase chain reaction reactions were run on each gel: human 
DNA with IL-5 primers, hamster DNA with IL-5 primers, and IL-5 
primers alone. An example of a single CHEF gel track of Not I 
digested HHW105 genomic DNA and of the polymerase chain reac- 

20 tion products is shown in Figure 9. As seen in this figure, a 
single DNA sample contained the majority of the IL-5 fragment; 
this polymerase chain reaction product comigrated with the 
human DNA control polymerase chain reaction product. The 
hamster DNA and primers alone showed no polymerase chain 

25 reaction products. 

D. The Second CHEF Gel 

In order to achieve a further purification of the rele- 
vant fragment from the contaminating hamster fragments, the 
30 DNA in the slice corresponding to the IL-5-positive polymerase 
chain reaction product was subjected to digestion with the 
Nrul restriction endonuclease, and size fractionated on a CHEF 
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gel (as described above for Not I in Section B) . The gel was 
sliced as above and a small aliquot of DNA obtained from each 
slice was subjected to polymerase chain reaction amplification 
with the IL-5-specific primers to determine the positive 
5 fractions. The polymerase chain reaction amplification pro- 
ducts were resolved on a conventional agarose gel, the DNA 
molecules transferee to nitrocellulose, and, in this case, the 
IL-5-specific fraction was identified by hybridization with a 
radiolabeled IL-5 cDNA probe. The resulting autoradiogram is 
10 shown in Figure 10. 

E. Amplification of the IL-5 positive fraction by sequence- 
independent amplification . 

In order to have sufficient quantities of these genomic 
15 DNA fragments for use in cDNA library screening and for the 
construction of a library of these genomic fragments, the 
genomic DNA fragments were amplified using the sequence-in- 
dependent single primer amplification (SISPA) method described 
in Section C. To generate the blunt-ended molecules necessary 
20 for the linker addition, the DNA from the relevant size frac- 
tion which was identified in the second CHEF gel was digested 
with EcoRV, which generates blunt end cuts in the DNA at a 
hexanucleotide recognition site. . The digestion products were 
fractionated on a conventional 0.8% agarose gel with size 
25 markers in parallel. The gel was rotated through 90° and the 
digested DNA was electroeluted onto a charged membrane filter 
(NA45, Schleicher and Schuell) . The filter was cut into ten 
strips, five corresponding to the positions of DNAs of 2 kb or 
less in size, and five corresponding to DNAs above 2 kb in 
30 size. The DNAs were eluted from the filter in very small 
volumes (elution buffer: 1 M NaCl, 50 mM Arginine for one hour 
at 65 e C), phenol/chloroform extracted, and precipitated in the 
presence of carrier t-RNA. DNAs above 2 kb in length were 
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subjected to one more set of restriction endonuclease diges- 
tions using Haelll and Rsal; both of these enzymes generate 
blunt ends and recognize four base pair sites. In this way 
the larger DNAs were reduced to lengths that could be more 
5 efficiently amplified. Each of the ten fractions were ligated 
using T4 DNA ligase (New England Biolabs) to the A linkers 
(Figure 2-A) and were then cut with Nrul to eliminate any 
linker dimers which had been formed in the ligation reaction. 
The volume of the restriction endonuclease digestion reactions 

10 were increased to 100 nl. The fractions were then passed over 
Sephadex™ G-50 (Maniatis et al., page 466). 

The genomic fragments were then amplified, as described 
in Section C and Example 2 using Taq DNA polymerase I (Perkin- 
Elmer Cetus) and the A primer under the following conditions: 

15 25 \il of the linkered fraction, 5 of 20 pM A primer, 10 |xl 
10 X amplification reaction buffer (100 roM Tris HC1, pH=8.3, 
500 mM KC1, 15 mM MgCl 2 , 0.1% (w/v) gelatin), 16 ^1 1.25 mM 
dNTPs, 43.5 \il water, 0.5 \il Taq I DNA polymerase. The reac- 
tion mixture was cycled 25 times for the following periods: 

20 denature at 94°C for 1 minute, anneal at 50°C for 1 minute, and 
extension at 72°C for 2 minutes. An aliquot of each amplified 
fraction was separated on a 1% agarose gel which is shown in 
Figure 11. A smear of amplified products was seen. Carrier 
t-RNA treated in parallel does not amplify. The furthest most 

25 right hand lane of Figure 11 is the size standard lambda 
Hindlll. The DNA in this gel (the final SISPA gel) was trans- 
ferred to a GeneScreen™ filter (DuPont/New England Nuclear) , 
the DNA cross-linked to the filter by UV, and the filter pre- 
hybridized overnight at 65°C in GeneScreen™ buffer (Church and 

30 Gilbert) with the addition of 100 ]ig/ml of sheared herring 
sperm DNA (the final SISPA filter, see Example 7) . 
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An aliquot of the amplified genomic fragments (not bound 
to nitrocellulose) was used for the construction of a fragment 
library. 

5 Example 6 

Preparation of the T-cell cDNA library 
T-cells were separated from whole blood by standard 
methods and the leu3+ fraction specifically selected. The 
leu3+ T-cells were induced with PMA (phorbol myristate ace- 
10 tate) by standard procedures. RNA was extracted from the T- 
cells 6 hours post-induction with PMA, and the poly-A + RNA 
fraction was purified from T-cell total RNA by affinity chro- 
matography on an oligo-dT cellulose column. A leu 3+ T-cell 
cDNA library was prepared essentially as described by Maniatis 
15 et al (pages 229-246) . Synthesis of the first strand was 
achieved using an oligo-dT primer and reverse transcriptase. 
Second strand cDNA synthesis was performed by a combination of 
the Klenow fragment of DNA polymerase I, followed by treatment 
with reverse transcriptase. Hairpin structures were resolved 
20 with S x nuclease. EcoRI linkers were blunt end ligated onto 
the newly synthesized duplex cDNA molecules and these mole- 
cules were then ligated into the lambda gtlO vector for clon- 
ing purposes (Protoclone™ lambda gtlO, Promega) . 

A 20 \ig aliquot of the T-cell DNA library DNA was diges- 
25 ted with EcoRI to liberate the cDNA inserts from the phage 
arms (approximately 4% of the total weight is insert cDNA) . 
The EcoRI ends were then converted to blunt ends by filling in 
with dATP and TTP, using the Klenow fragment of DNA polymerase 
I (Maniatis et al., page 113). The use of only dATP and TTP 
repairs only the EcoRI sites but not the cos sites of lambda, 
thus avoiding the potential problem of creating blunt end 
termini on both ends of the contaminating phage vector DNA. 
The C linker (Figure 2-C) was then ligated to this total cDNA 
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library and the ligation mixture digested with Nrul to remove 
dimers. The volume of the reaction mixture was then increased 
to 1.0 ml with GeneScreen™ buffer, sheared lambda gtlO DNA 
added to a final concentration of 100 (ig/ml, and the mixture 
5 incubated at 65°C for 90 minutes; this mixture is the probe 
used in Example 7. 

In addition, this preparation of the cDNA library may be 
pre-hybridized to randomly-sheared total human genomic DNA, 
before hybridization to the final SISPA filter, to pre-block 

10 repetitive sequences present in the cDNA library and thus 
reduce the background/ signal noise (see Section ii-b(III)). 
The cDNA library can be amplified using the Olinker sequence 
as an end-terminal primer for amplification (Example 7) • 
Alternatively, before the cDNA molecules are liberated from 

15 the lambda gtlO vector, common vector sequences flanking the 
5' and 3' end of the cDNA inserts can be used as end-terminal 
primers to amplify the cDNA inserts. 



Example 7 

20 Hybridization of the T-cell cDNA library to the Chromosome 5 

derived genomic DNA 
A. Hybridization and Washing 

The T-cell cDNA library probe constructed in Example 6 
was added to the final SISPA filter prepared in Example 5 in a 
25 total volume of 10 mis of GeneScreen™ buffer. The hybridiza- 
tion was carried out for 48 hours at 65°C. The filter was 
then extensively washed under the following conditions: 
1. Two washes with 1 X SSC / 0.1% SDS at room temperature 
for 15 minutes each. 
30 2. Two washes with 1 X SSC / 0.1% SDS at 65°C for 15 min- 
utes each. 

3. Two washes with 0.1 X SSC / 0.1% SDS at 65°C for 15 
minutes each. 
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4. A final wash with 0.1 X SSC / 0.1% SDS at 65°C for 20 
hours . 

B. Elution of the selected cDNAs from the filter 

5 After the final wash, the selected cDNAs were eluted 

from the filter by treatment with 5 mis of 50 mM NaOH at room 
temperature. This eluent was neutralized to P H=8.0 with 
approximately 2 mis of 1.0 M Tris HC1. The cDNAs were precipi- 
tated with NaCI, yeast RNA, and ethanol, and then resuspended 
10 in 100 m of water. The resuspended cDNAs were then passed 
over a Sephadex™ G-50 column. 

C. Amplification and Analysis of the selected cDNAs 

The cDNAs present in the eluent were then amplified 
15 using the C primer and the following conditions: 30 m eluted 
material, 10 ul 10 X amplification reaction buffer, 16 |il 1.25 
mM dNTPs, 5 ul 20 uM C primer, 45 ul water, 0.5 ul Tag I DNA 
polymerase. The amplification reaction was carried out for 25 
cycles as follows: denature at 94°C for 1 minute, anneal at 
20 45°C for 1 minute, and extend at 72 e C for 2 minutes. The 
amplification products were run on a conventional agarose gel 
to examine the products (Figure 12) . The DNA in the gel was 
transferee to a nitrocellulose filter and the filter hybri- 
dized with 6 radiolabeled probes: actin, hamster genomic DNA, 
25 human genomic DNA, IL-3 specific (Yang et al.), IL-5 specific 
(Azume et al.), and human mitochondrial DNA specific (CF14) 
(Figure 13) . 
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Selection of cDNAs corresponding to the genomic region 
containing IL-3 and GMCSF 
The methods applied to the analysis of IL-5 have also 
5 been applied to IL-3. Specific probes for IL-3 (Yang et al.) 
and GM-CSF (Schrader et al.) are known. Using an IL-3 specific 
probe the IL-3 containing region of chromosome 5 was isolated 
as described above; a long-range restriction map of this 
region is shown in Figure 15. By probing isolated genomic 

10 clones with labelled probes specific for IL-3 and GM-CSF we 
showed a physical linkage between these loci of less than 10 
kilobases. T-cell cDNAs were then selected on the basis of 
their homology to the selected IL-3 containing region genomic 
DNA region. Figure 14 shows the hybridization of these selec- 

15 ted cDNAs with probes specific for IL-3, GM-CSF, and actin, 
and also a total human genomic probe. Specific hybridization 
to IL-3 and GM-CSF was clearly visible. The lack of hybridiza- 
tion to actin underscores that the hybridization to IL-3 and 
GM-CSF was specific and the lack of hybridization to total 

20 human genomic DNA indicates the absence of large numbers of 
repetitive sequence DNA in the selected cDNAs. 



Example 9 

Isolation of cDNAs derived from 10C9 mRNAs which are 
25 homologous to the Bovine Leukosis Virus genome 

A. Preparation of the Genomic Bovine Leukosis Virus DNA 

The cloning and sequencing of the bovine leukosis virus 
(BLV) genome have been previously described (Kashmiri et al., 
30 Sagata et al.). Full length molecular DNA clones of BLV cor- 
responding to the RNA genome of the virus were obtained from 
J.F. Ferrer, University of Pennsylvania, School of Veterinary 
Science. 100 nanograms of BLV DNA was heat denatured to yield 
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single-strands and spotted to a 0.45 micron nitrocellulose 
filter (Schleicher and Schuell) . The filter was then baked 
for 2 hours at 75° under vacuum to fix the DNA to the filter 
(Ricciardi et al.). The following solution was used for pre- 
5 hybridization of the filter: 50 mM Tris HC1, pH=7.5; 1 mM 
EDTA; 0.5% SDS; 3 X SSC (20 X SSC stock solution = 175.3 g 
NaCl, 88.2 g sodium citrate, adjusted to pH=7, having a total 
volume of 1 liter of water) / 1 x FPB (10 X FPB stock solution 
- 0.2% Ficoll, 0.2% polyvinylpyrrolidone, and 0.2% bovine 
10 serum albumin) ; 50% Formamide; and, 100 ug/ml denatured salmon 
sperm DNA. Pre-hybridization was carried out for 20 hours at 
37°C. 



B. Preparation of the Human Lymphoma-associated Virus cDNA 
15 from cell line 10C9 

10C9 virus particles were purified from 10C9 cell cul- 
ture supernatants which were clarified of cellular debris by 
centrifugation (10,000 X g, 15 minutes at 4°C) , concentrated 
100-fold by Amicon™ ultrafiltration, and pelleted through a 
20 25% sucrose cushion/PBS (100,000 X g, 4°C for 90 minutes). 
The resultant pellet was resuspended in Dulbecco's Phosphate 
Buffered Saline (PBS), homogenized, and overlaid on a 15-60% 
sucrose density gradient, centrifuged for 16 hours at 100,000 
X g, 4°C; the virus band at 1.15-1.18 g/ml was collected, 
25 diluted with PBS and pelleted by ultracentrifugation to remove 
the sucrose (100,000 X g, 4°C for 90 minutes). 

The viral RNA was extracted from the particles by the 
method of Chomzynski et al. and was then used as template in 
random-primed cDNA synthesis (kit obtained from Boerhinger- 
30 Mannheim). The cDNA molecules were blunt-ended using the 
Klenow fragment of DNA polymerase I and all four deoxyribo- 
nucleotides (Maniatis et al., page 113). The A linkers (see 
Figure 2-A) were then ligated to the ends of the cDNA mole- 
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cules using T 4 DNA ligase (New England Biolabs) . These cDNA 
molecules were then digested with Nrul to eliminate any linker 
dimers which had been formed in the ligation reaction. 
The 10C9 cDNA library was then amplified using the sequence- 
5 independent single primer amplification method described in 
Examples 2 and 6. 

C. Hybridization of the 10C9 cDNA library to the BLV genome 
The pre-hybridized filter described above was then 

10 hybridized for 36 hours at 37°C with the 10C9 cDNA library 
under the following conditions: 50 mM Tris HC1, pH=7.5; 1 mM 
EDTA; 0.5% SDS; 3 X SSC (20 X SSC stock solution = 175.3 g 
NaCl, 88*2 g sodium citrate , adjusted to pH=7, having a total 
volume of 1 liter of water) ; 1 X FPB (10 X FPB stock solution 

15 = 0.2% Ficoll, 0.2% polyvinylpyrrolidone, and 0.2% bovine 
serum albumin) ; 50% Formamide; 10% Dextran Sulfate; and, 100 
Hg/ml of the 10C9 amplified cDNA library. 

After the hybridization period the filter was transferee 
to a clean 1.5 ml microtube and the following series of washes 

20 performed: 

1. 100 |il of 2 X SSC and 0.5% SDS, at room temperature for 
fifteen minutes. 

2. Repeat step 1. 

3. 100 |il 2 X SSC and 0.1% SDS, at 60°C for 30 minutes. 
25 4. Repeat step 3. 

5. 100 |il 0.5 X SSC and 0.1% SDS at 60°C for 30 minutes. 

6. Repeat step 5. 

7. Elution of the bound cDNAs was achieved by washing with 
100 |il of 0.1 X SSC and 0.1% SDS at 95-10Q°C for 15 

30 minutes . 

8. Repeat step 7. 
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The eluents were run on a conventional agarose gel, 
transferred to nitrocellulose and hybridized with an IL-5 spe- 
cific probe (Campbell et al; Azume et al) ; elution of IL-5 
specific sequences was seen only in the eluent of steps 7 and 
5 8. 

The eluents were amplified as described in Example 7C 
except using the A primer. 

Example 10 

10 Characterization of 10C9 Virus cDNAs homologous to the BLV 

genome 

The amplified selected 10C9 cDNAs present in the eluents 
of Example 9 were then cloned to create two libraries: one in 
lambda gtlO (Protoclone™ lambda gtlO system, Promega) and one 

15 in lambda gtll (Protoclone™ lambda gtll system, Promega) . The 
linkers used in the construction of the amplified cDNAs also 
contained an EcoRI site (see Figure 2) which allowed for 
direct insertion of the amplified cDNAs into both lambda gtlO 
and gtll vectors. 

20 The fusion proteins expressed by the lambda gtll clones 

were screened with antibody probes (Young et al . ) . Screening 
with bovine anti-BLV (cc-BLV) detected hundreds of positive 
clones from 4 X 10* plaques screened. Screening with <x-p24 
detected 15 positive plaques out of 10 4 screened, these posi- 

25 tives were plaque purified and cross -screened with o- Blv , 4/15 
of these clones were positive with both anti-sera. Screening 
with o-gp62 detected 4 positive clones which were all also 
positive when cross-screened with a-BLV. 

The lambda gtlO cDNA library was screened with a full 

30 length molecular clone of BLV. Plaque hybridization screening 
(Woo) of lambda gtlO clones containing approximately 600 base 
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pair inserts yielded 76 positives out of 2 X 10 4 plaques 
screened. 

While preferred embodiments, uses, and methods of prac- 
ticing the present invention have been described in detail, it 
5 will be appreciated that various other uses, and methods of 
practice as indicated herein are within the contemplation of 
the present invention. 
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WHAT IS CLAIMED IS: 

1. A method of cloning cDNA species which are homolo- 
gous to a region of contiguous genomic DNA and are selected 
5 from a mixture of cDNA species, said, method comprising, 

(a) obtaining the region of- interest of contiguous 
genomic DNA, 

(b) preparing cDNA species to contain end-terminal 
priming sequences, 

10 < c > isolating single-stranded cDNA species on the basis 

of their hybridization to the region of contiguous genomic 
DNA, 

(d) mixing the isolated single-stranded cDNA species 
with DNA polymerase, all four deoxyribonucleotide triphos- 

15 phates, and primers homologous to the cDNA end-terminal prim- 
ing sequences, 

(e) reacting the mixture under conditions to produce 
sequence-independent amplification of the single-stranded cDNA 
species, and 

20 < f > cloning the amplified cDNA species into a vector. 

2. The method of claim 1, wherein the region of con- 
tiguous DNA contains a region of interest, and said obtaining 
includes (a) size fractionating a DNA section containing the 
25 region of interest, (b) identifying the fraction containing 
the region, (c) isolating the DNA from the identified frac- 
tion, and (d) digesting the isolated DNA with a restriction 
endonuclease, to cut the region of interest into a plurality 
of smaller fragments. 

30 
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3. The method of claim 2, wherein the DNA containing 
the region of interest is identified in the fraction by use of 
primer-initiated amplification in the presence of primers 
which are homologous region ♦ 

5 

4. The method of claim 1, wherein the contiguous geno- 
mic DNA section is bound to a solid support. 

5. The method of claim 1, wherein the contiguous geno- 
10 mic DNA section is contained in yeast artificial chromosomes. 

6. The method of claim 1, wherein the contiguous geno- 
mic DNA section represents the bovine leukosis virus genome 
and the mixture of cDNA species is a cDNA library made from 

15 mRNA obtained from a cell line infected with a virus derived 
from cell line 10C9. 

7. The method of claim 1, for use in identifying cDNAs 
which correspond to genes located in the same chromosomal 

20 region as a known gene, wherein fragments containing said 
contiguous genomic DNA region are obtained by, 

(a) preparative size fractionating of genomic DNA 
fragments containing the known gene, 

(b) ligating linkers to the genomic DNA fragments, 
25 containing the known gene, which are useful as primers for 

sequence-independent amplification, 

(c) mixing the DNA fragments with DNA polymerase, all 
four deoxyribonucleotides, and primers homologous to the 
linkers present on the ends of the DNA fragments, and 

30 (d) reacting the mixture under conditions to produce 

sequence-independent amplification of the DNA fragments. 
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8. The method of claim 7, wherein the preparative size 
fractionating of genomic DNA fragments containing the known 
gene involves identification of the DNA fragments of interest 
by adding primers homologous to the known gene, DNA polymer- 
ase, and all four deoxyribonucleotides, to the gel matrix and 
treating the gel matrix under conditions which promote amplif- 
ication of the region of the known gene defined by the pri- 
mers. 



9. The method of claim 1, wherein the known gene is a 
growth factor or growth factor receptor gene. 

10. The method of claim 9, wherein the known gene is an 
interleukin . 

11. The method of claim 10, wherein the known gene is 
IL-5 and the mixture of cDNA species is a T-cell cDNA library. 

12. The method of claim 9, 'wherein the known gene is 
erythropoietin and the mixture of cDNA species is a renal cDNA 
library . 



13. The method of claim 1, wherein said preparing of 
cDNA species to contain end-terminal priming sequences further 

25 includes the amplification of the cDNA species by mixing the 
single-stranded cDNA species with DNA polymerase, all four 
deoxyribonucleotide triphosphates, and primers homologous to 
the cDNA end-terminal priming sequences, and reacting the 
mixture under conditions to produce sequence-independent 

30 amplification of the single-stranded cDNA species. 
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14. The method of claim 1, wherein said preparing of 
cDNA species to contain end-terminal priming sequences in- 
cludes the cloning of the cDNA species into a suitable vector 
and using the known 5' and 3' vector sequences , which flank 

5 the cDNA insert, as end-terminal priming sequences. 

15. The method of claim 14 , wherein said preparing 
further includes the amplification of the cDNA species by 
mixing the single-stranded cDNA species with DNA polymerase, 

10 all four deoxyribonucleotide triphosphates, and primers homol- 
ogous to the cDNA end-terminal priming sequences, and reacting 
the mixture under conditions to produce sequence-independent 
amplification of the single-stranded cDNA species. 

15 16. A method of cloning cDNA species which are homolo- 

gous to a region of contiguous genomic DNA and are selected 
from a mixture of cDNA species, said method comprising, 

(a) preparing cDNA species to contain end-terminal 
priming sequences, 

20 (b) isolating single-stranded cDNA species on the basis 

of their hybridization to a first set of selected genomic 
fragments, 

(c) mixing the isolated single-stranded cDNA species 
with DNA polymerase, all four deoxyribonucleotide triphos- 

25 phates, and primers homologous to the cDNA end-terminal prim- 
ing sequences, 

(d) reacting the mixture under conditions to produce 
sequence-independent amplification of the single-stranded cDNA 
species, 

30 (e) isolating single-stranded cDNA species, from the 

mixture of step (d) , on the basis of their hybridization to a 
second set of selected genomic fragments, 
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(f) mixing the isolated single-stranded cDNA species of 
step (e) with DNA polymerase, all four deoxyribonucleotide 
triphosphates, and primers homologous to the cDNA end-terminal 
priming sequences, 

(g) reacting the mixture under conditions to produce 
sequence-independent amplification of the single-stranded cDNA 
species, 

(h) cloning the amplified cDNA species into a vector. 
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